06-11-2023 · Insight

The energy challenge of powering AI chips

Information processing isn’t the only thing artificial intelligence amplifies, it also intensifies energy consumption and heat generation. To keep pace and keep cool, data centers will need massive upgrades, creating strong structural drivers and potentially recession-proof revenue growth for companies supplying energy-efficient equipment and energy management systems.

    Authors

  • Michael Studer PhD - Co-Portfolio Manager Smart Energy

    Michael Studer PhD

    Co-Portfolio Manager Smart Energy

The pace of artificial intelligence (AI) software adoption is one of the fastest adoption curves markets have ever seen. The large language models (LLM) used by ChatGPT and similar AI bots to generate humanlike conversations are just one of many new AI applications that rely on ‘parallel computing’, the term used to describe the massive computational work performed by networks of chips that carry out many calculations or processes simultaneously.

At the core of the AI infrastructure are GPUs (graphic processing units) that excel at the type of specialized, high-performance parallel computing work required by AI. All that processing power also results in higher energy inputs that generate more heat output compared to their CPU (central processing units) counterparts used in PCs. See Figure 1.

Figure 1 - Core comparison – CPUs vs GPUs

Figure 1 - Core comparison – CPUs vs GPUs

Source: GPU vs CPU – Difference Between Processing Units – AWS (amazon.com)

Small chips, big impact on data centers

High-end GPUs are about four times more power dense than CPUs. This constitutes significant new problems for the data center planning as the originally calculated power supply is now only 25% of what is needed to run modern AI data centers. Even the cutting-edge, hyperscaler data centers used by Amazon, Microsoft and Alphabet for cloud-based computing are still CPU-driven. To illustrate, Nvidia’s currently supplied A100 AI chip has a constant power consumption of roughly 400W per chip, whereas the power intake of its latest microchip, the H100, nearly doubles that – consuming 700W, similar to a microwave. If a full hyperscaler data center with an average of one million servers replaced its current CPU servers with these types of GPUs, the power needed would increase 4-5 times (1500MW) – equivalent to a nuclear power station!

GPUs are around four times more power-dense than CPUs

Open your portfolio to the power of themes

For over 25 years, Robeco has been a pioneering leader in constructing thematic strategies.


Discover more

This increase in power density means that these chips also generate significantly more heat. Consequently, the cooling systems also have to become more powerful. Power and cooling changes of these magnitudes will require totally new designs for future AI-driven data centers. This creates a huge demand-supply imbalance on the underlying chip and data center infrastructure. Given the time it takes to build data centers, industry experts project that we are in the first innings of a decade-long modernization of data centers to make them more intelligent.

Figure 2 - Growth in power consumption of US data centers (gigawatts)

Figure 2 - Growth in power consumption of US data centers (gigawatts)

Includes power consumption for storage, servers and networks. Gray indicates energy consumption of enterprise data centers with vertically, light blue ‘co-location’ companies that rent and manage IT data center facilities on behalf of companies, dark blue indicates hyperscaler data centers.

Source: McKinsey & Company, Investing in the rising data center economy, 2023.

Operation overhaul – Equipping data centers for AI needs

Structural changes of this magnitude will lead to massive upgrades of not only chips and servers but also the electric infrastructure that supplies them with the power to operate.

The laws of physics dictate that increasing power supply means increasing electric voltage and/or current (Power = voltage x current). Increasing voltage is the more viable option.1 Accordingly, the industry is working on increasing the voltage, which requires redesigning many standard components that were defined in the early era of computers and servers, when power densities were relatively low (2-3kW per rack). This means new configurations for power cords, power distribution racks and converters as they exceed the current formats (See Figure 3). On a chip level, even bigger challenges exist, as higher voltage and therefore more power for the GPU requires a full redesign of the chip power layout.

Figure 3 – More than a cable and plug – The complexity of data center electrical systems

Figure 3 – More than a cable and plug – The complexity of data center electrical systems

Powering a data center involves multiple interconnected systems that must function seamlessly for optimal processing performance. LV = Low voltage devices, MV = Medium voltage devices

Source: Green Data Center Design and Management

Data center cooling is key to ensure high system performance and prevent malfunctioning. Traditional HVAC2 solutions that use air conditioning and fans to cool the air in data center server rooms are sufficient for CPUs whose server racks manage power densities between 3-30 kW, but not for GPUs whose power densities easily go beyond 40kW.3 As the latest GPU racks exceed these power levels, additional liquid cooling is (once again) at the forefront. They allow for even higher heat dissipation on a server rack or chip level because liquids have a greater capacity to capture heat per unit volume than air. Some of the biggest challenges for liquid cooling however are 1) the lack of standardized designs and components for such systems, 2) different technology options such as chip or rack cooling, and 3) high costs for pipes and measures to prevent leakage.

Our outlook on the AI revolution and its impact on data centers’ energy requirements

The AI revolution will require nothing short of a complete re-engineering of the data center infrastructure from the inside out to accommodate for the much higher energy needs of new AI technology. This will result in a strong boost in demand and investments for data centers on low-power computing applications, energy-efficient HVAC and power management solutions – all of these Big Data energy efficiency solutions are key investment areas for the smart energy strategy.

The AI revolution will require nothing short of a complete re-engineering of the data center infrastructure from the inside out

Revenues for companies supplying energy efficiency solutions to data centers are expected to show strong growth. This has also led to a re-rating in the market with these companies now trading more on the upper end of their historical range on the back of the much stronger growth outlook. Given the strong momentum and underlying structural drivers, we fundamentally really like this part of the smart energy investment universe. We also believe that this area can decouple from a potential recession, as spending on much-needed data centers and related energy efficiency solutions will not be dependent on the economic cycle.

Footnotes

1 Increasing the current requires thicker wires and costs valuable space – not a viable option for the tightly packed server rack layout of today’s data centers.
2 Heating, ventilation and air conditioning
3 According to Schneider Electric, GPU server rack power density levels average 44kW

Let's keep the conversation going

Keep track of fast-moving events in sustainable and quantitative investing, trends and credits with our newsletters.

Don’t miss out
Robeco

Robeco aims to enable its clients to achieve their financial and sustainability goals by providing superior investment returns and solutions.

Important information This disclaimer applies to any documents and the verbal or written comments of any person in presentations or webinars on this website and taken together is referred to herein as the “Information”. The services to which the Information relate are NOT FOR RETAIL CLIENTS - The information contained in the Website is solely intended for professional investors, defined as investors which (1) qualify as professional clients within the meaning of the Markets in Financial Instruments Directive (MiFID), (2) have requested to be treated as professional clients within the meaning of the MiFID or (3) are authorized to receive such information under any other applicable laws and must not be relied or acted upon by any other persons. This Information does not constitute an offer to sell, or a solicitation of an offer to buy, any financial product, and may not be relied upon in connection with the purchase or sale of any financial product. You are cautioned against using this Information as the basis for making a decision to purchase any financial product. To the extent that you rely on the Information in connection with any investment decision, you do so at your own risk. The Information does not purport to be complete on any topic addressed. The Information may contain data or analysis prepared by third parties and no representation or warranty about the accuracy of such data or analysis is provided.
In all cases where historical performance is presented, please note that past performance is not a reliable indicator of future results and should not be relied upon as the basis for making an investment decision. Investors may not get back the amount originally invested. Neither Robeco Institutional Asset Management B.V. nor any of its affiliates guarantees the performance or the future returns of any investments. If the currency in which the past performance is displayed differs from the currency of the country in which you reside, then you should be aware that due to exchange rate fluctuations the performance shown may increase or decrease if converted into your local currency. Robeco Institutional Asset Management B.V. (“Robeco”) expressly prohibits any redistribution of the Information without the prior written consent of Robeco. The Information is not intended for distribution to, or use by, any person or entity in any jurisdiction or country where such distribution or use is contrary to law, rule or regulation. Certain information contained in the Information includes calculations or figures that have been prepared internally and have not been audited or verified by a third party. Use of different methods for preparing, calculating or presenting information may lead to different results. Robeco Institutional Asset Management UK Limited (“RIAM UK”) is authorised and regulated by the Financial Conduct Authority. RIAM UK, 30 Fenchurch Street, Part Level 8, London EC3M 3BD (FCA Reference No:1007814). The company is registered in England and Wales under Ref No. 15362605.

In all cases where historical performance is presented, please note that past performance is not a reliable indicator of future results and should not be relied upon as the basis for making an investment decision. Investors may not get back the amount originally invested. Neither Robeco Institutional Asset Management B.V. nor any of its affiliates guarantees the performance or the future returns of any investments. If the currency in which the past performance is displayed differs from the currency of the country in which you reside, then you should be aware that due to exchange rate fluctuations the performance shown may increase or decrease if converted into your local currency. Robeco Institutional Asset Management B.V. (“Robeco”) expressly prohibits any redistribution of the Information without the prior written consent of Robeco. The Information is not intended for distribution to, or use by, any person or entity in any jurisdiction or country where such distribution or use is contrary to law, rule or regulation. Certain information contained in the Information includes calculations or figures that have been prepared internally and have not been audited or verified by a third party. Use of different methods for preparing, calculating or presenting information may lead to different results. Robeco Institutional Asset Management B.V. is authorised as a manager of UCITS and AIFs by the Netherlands Authority for the Financial Markets and subject to limited regulation in the UK by the Financial Conduct Authority. Details about the extent of our regulation by the Financial Conduct Authority are available from us on request.