Data Center Deep Dive: Insights on Transformative Role of Nvidia's Blackwell Chips

OpenAI' introduced a different type of model o1 (a.k.a.) which is parallel to GPT-4 but fundamentally distinct. The o1 is important because it is not just a one-off model but a new paradigm for training models.

The key difference between GPT-4 and o1 is that the latter can reason. The model spends a few minutes thinking before generating an answer. To do this, an additional step of Inference Compute is required, which will increases the demand for compute significantly.

Current AI models are trained on a data set and infer or generate conclusions based on that data set. While this works well for many applications e.g., search, the models hit a plateau and don't improve. The only way to progress is to train a new generation of models.

In the future, as compute costs decline, companies will start introducing models that can reason and continuously learn. The applications for these will be anything that requires real-time response and accurate action. The o1 model is just one step in that direction. Continuous learning would take thinking to the next level.

Nvidia's CEO, Jensen Huang, highlighted this paradigm shift in a keynote at Stanford this summer: "Today, we learn and apply (train => inference); in the future, we will have continuous learning."

Note that inference today is a large market (we estimate >50%), but Inference Compute is a whole new use case.

Generative AI

Lay of the Data Center Land

In our recent webinar, we explored the evolving landscape of the AI data center market. For those who missed it, here’s a concise overview of our findings.

The AI data center value chain is split into two parts: inside the rack and outside the rack. Processors, Networking, and Memory are key areas inside the rack. Thermal Management, Power Management, and Manufacturing Equipment are key areas outside of the rack.

Inside the Rack

Market Overview and Growth

The key driver for Data Center investment has been capex spending by the Cloud Service Providers (CSPs). Capex growth surprised to the upside this year and we expect it to surprise again next year as companies are starting to provide qualitative commentary this quarter.

Top 3 Cloud Vendor Capex

In addition to the cloud service providers, enterprises and governments are now coming to the market with incremental demand. As an example, xAI's data center cluster, Colossus (100K H100 GPU cluster), just came online and is expected to double in size in a few months! Moreover, per Elon Musk's comment, xAI has 300 B200s on order for 2025

This is just one example. Every other model provider will have to follow if they want to stay competitive.

Who stands to benefit? Let's dive into the value chain.

In The Rack

Processors

Processors are the largest market segment, representing >60%% of the total market, and are expected to grow at a 40% CAGR from 2023 to 2027. There are companies that design the chips (e.g., Nvidia, AMD (NASDAQ:AMD)), which is an asset-light business model, and companies that manufacture them (e.g., TSMC), which is an asset-heavy business model.

Market Size

Nvidia is the leader on the chip design side with over 80% market share, but we expect, over time, both custom chips (i.e., chips developed by the hyperscalers) and AMD to be able to capture market share.

What is driving Nvidia's advantage?

The main point that people don't appreciate is that software is not automatically accelerated with a GPU, and accelerated computing requires algorithms to be re-designed in order to be able to use a GPU effectively. This is where NVIDIA differentiates itself with over 400 CUDA libraries, which deliver dramatically higher performance than alternatives.

Consequently, although at face value, Nvidia's and AMD's chips look comparable, most customers are often time unable to get similar utilization out of the AMD or custom chips. There is a subset of more sophisticated customers that build their own software and can get an even more attractive total cost of ownership (TCO) from competitors compared to using Nvidia's chips, but they have to do the heavy lifting.

Examples are established tech companies like Meta (NASDAQ:META) & Microsoft, which are AMD's customers, and Databricks, an AWS custom chips customer.

In addition, with Nvidia's product innovation cycle shortened to one year, it is challenging for competitors to keep up with the development timeline. At the current utilization that customers are getting, AMD is considered to be a year behind Nvidia.

Despite Nvidia's product being superior, there is room for competitors due to capacity constraints and supply security. Some thoughts on the competitive landscape:

On the custom side (ASICs) companies like Broadcom (NASDAQ:AVGO) and Marvell (NASDAQ:MRVL) work with major Cloud Service Providers (CSPs) to design chips in-house.
Broadcom's CEO recently stated that over time, he expects that most of the CSP market will be custom chips. For background, CSPs generally represent 50% of the market, with Enterprise the other 50%.
We estimate that custom chips will take half of the CSP market (1/4 of the total) as, in many cases, customers specifically ask for an Nvidia chip if they plan to leverage the above-mentioned CUDA libraries.
Another formidable competitor is AMD. The company is particularly well positioned when Nvidia is out of capacity, as its chips are comparable in performance but require extra work. We expect that AMD will be able to capture ~MSD market share in the near term, and ~ 10% in the medium term is meaningful ($500B '28 TAM vs. $4.5B '24 revenue base).

Networking

Networking is one of the areas with the highest growth potential and solid business models. Networking is expected to be a key driver behind achieving performance improvement for each generation of GPUs.

The key to growth in networking is that growth is multiples of the growth in the compute units, as they all need to be connected to each other.

Here are some highlights.

We expect this market to reach $100B by 2027. Key components of the networking market are:

Switches (Ethernet, InfiniBand) with key players Nvidia, Broadcom and Arista (NYSE:ANET).
Interconnects with key players Broadcom and Marvell.
Cables & Connectors with key players Amphenol (NYSE:APH) and TE Connectivity (NYSE:TEL).

While there are many specialized networking products, there are relatively few players competing in each specific technology and are therefore able to capture high margins.

Memory

Memory has historically been a more challenging and cyclical market due to competitive dynamics. AI requires High Bandwidth (NASDAQ:BAND) Memory, which is more complex to make, but the question is whether this time will be different from a competitive standpoint. The silver lining is that HBM takes up capacity from traditional Memory, tightening the market.

Key players in the memory market are SK Hynix, Micron (NASDAQ:MU) and Samsung (KS:005930). SK has a leadership position in HBM memory but both Micron and Samsung are scaling their capacity in '25. HMB Market Breakdown

Thermal and Power Management

Power Management is an important area both outside of the rack and as it extends to the grid. It is a relatively consolidated market with companies like Eaton (NYSE:ETN), Schneider, commanding significant market share.

Due to the increased focus on thermal management, liquid cooling has emerged as a very attractive market.

Power and Thermal Mgmt

Equipment Providers

Further back in the value chain, there are the Semi-Equipment providers and Test and Measurement providers. People often miss that these sub-segments follow their own cycle (different from AI GPUs' cycle).

For example, manufacturing equipment had a strong capex cycle (21-23), and now customers are digesting the previously added capacity. In many cases, like in memory, the capacity for traditional products can be converted to make AI products, which, given the muted economy, does not result in more limited incremental demand. Even TSMC, a company that is leading the way in manufacturing, sees a muted capex cycle and guided to the lower end of its prior capex guidance.

As we get further into the AI opportunity and the $500B near-term opportunity, there will be a need for incremental manufacturing capacity, which will be positive for the equipment providers. But we are not there just yet.

Quantifying the AI Impact on Electricity Demand

One of the most significant medium-term bottlenecks for the Data Center build-out is power availability. One large data center consumes the equivalent of a small city and by '27 we expect that Data Centers will consume the equivalent of 40 million houses or 1/3 of the US residential power market by 2027.

We are projecting that data centers will increase their share of power demand from approximately 3% in 2023 to over 10% by 2027.

Power Consumption

View all comments (0)0

Latest comments

Loading next article…

US 30

38,227.50

-2,318.4

-5.72%

US 500

5,062.70

-333.9

-6.19%

Dow Jones

38,314.86

-2,231.07

-5.50%

S&P 500

5,074.08

-322.44

-5.97%

Nasdaq

15,587.79

-962.82

-5.82%

S&P 500 VIX

45.31

+15.29

+50.93%

Dollar Index

102.77

+0.970

+0.95%

Name	Last	Chg. %	Vol.
DECK	106.02	+5.10%	8.97M
DHI	127.87	+4.55%	9.12M
NVR	7,410.9	+4.23%	41.78K
MSTR	293.61	+4.01%	24.85M
PHM	101.24	+3.57%	3.95M
BLDR	123.96	+3.46%	2.90M
LULU	263.70	+3.15%	8.37M

Name	Last	Chg. %	Vol.
APP	219.37	-16.26%	16.84M
GEHC	60.51	-15.96%	16.89M
APA	15.18	-14.43%	19.93M
BKR	35.41	-13.34%	20.97M
CE	40.99	-13.16%	5.32M
FCX	29.15	-13.01%	44.73M
MU	64.72	-12.94%	50.74M

Trending Stocks

Name	Last	Chg. %	Vol.
NVDA	94.31	-7.36%	532.27M
TSLA	239.43	-10.42%	181.23M
AAPL	188.38	-7.29%	125.91M
AMZN	171.00	-4.15%	123.16M
NKE	57.25	+3.00%	67.78M

Install Our AppScan QR code to install app

Risk Disclosure: Trading in financial instruments and/or cryptocurrencies involves high risks including the risk of losing some, or all, of your investment amount, and may not be suitable for all investors. Prices of cryptocurrencies are extremely volatile and may be affected by external factors such as financial, regulatory or political events. Trading on margin increases the financial risks.
Before deciding to trade in financial instrument or cryptocurrencies you should be fully informed of the risks and costs associated with trading the financial markets, carefully consider your investment objectives, level of experience, and risk appetite, and seek professional advice where needed.
Fusion Media would like to remind you that the data contained in this website is not necessarily real-time nor accurate. The data and prices on the website are not necessarily provided by any market or exchange, but may be provided by market makers, and so prices may not be accurate and may differ from the actual price at any given market, meaning prices are indicative and not appropriate for trading purposes. Fusion Media and any provider of the data contained in this website will not accept liability for any loss or damage as a result of your trading, or your reliance on the information contained within this website.
It is prohibited to use, store, reproduce, display, modify, transmit or distribute the data contained in this website without the explicit prior written permission of Fusion Media and/or the data provider. All intellectual property rights are reserved by the providers and/or the exchange providing the data contained in this website.
Fusion Media may be compensated by the advertisers that appear on the website, based on your interaction with the advertisements or advertisers.

Popular Searches

Please try another search