Stacy Lu
Support Team
Feedback:
support@nextpcb.comThe relentless expansion of artificial intelligence (AI) and deep learning models has driven silicon manufacturers to push the limits of semiconductor technology. Modern AI accelerators—such as the NVIDIA H100, B200, and AMD MI300X—pack tens of billions of transistors into a single package. To power these ultra-high-density chips, hardware engineers must design robust, highly efficient Power Delivery Networks (PDNs) capable of delivering currents exceeding 1,000A at sub-1V core voltages (Vcore).
A poorly optimized PDN in high-speed AI designs can result in severe voltage ripple, ground bounce, electromagnetic interference (EMI), and ultimately, system instability or permanent hardware failure. This comprehensive guide details the core pillars of designing a high-performance PDN for AI accelerator cards, focusing on Voltage Regulator Module (VRM) design, decoupling capacitor strategies, layout optimization, and advanced simulation methods.
In high-speed computing, the Power Delivery Network (PDN) is the entire physical path through which electrical current travels from the power source (such as the main server power supply unit, or PSU) to the silicon die. On an AI accelerator card, the PDN includes the power connectors, motherboard copper planes, Voltage Regulator Modules (VRMs), decoupling capacitors, PCB vias, IC package substrates, and finally, the on-die metal layers.
Designing a PDN for an AI board is significantly more complex than standard motherboard design due to several compounding factors:
For a detailed breakdown of the structural components and hardware architecture of AI-focused computational systems, refer to our comprehensive guide on What Is an AI Server? Architecture, Components & PCB Requirements.
To construct a reliable power network, hardware engineers must first analyze the current and voltage characteristics of modern AI silicon. The primary power rail is the Vcore (or Vdd) which powers the main computing arrays (tensor cores and stream processors). Secondary rails include high-bandwidth memory supplies (Vdd_hbm), high-speed SerDes I/O supplies, and auxiliary analog rails.
The following table compares the typical power and PDN profiles of legacy computing GPUs versus modern AI training processors:
| Parameter | Legacy Compute GPU (e.g., Tesla V100 Class) | AI Training Processor (e.g., H100 / B200 Class) | Impact on PCB PDN Design |
|---|---|---|---|
| Total Design Power (TDP) | 250W - 300W | 700W - 1000W+ | Requires massive multi-phase power planes and liquid cooling. |
| Core Operating Voltage (Vcore) | 0.85V - 1.0V | 0.6V - 0.8V | Narrower noise margins; higher susceptibility to IR drop. |
| Maximum Core Current (Imax) | 300A - 350A | 900A - 1200A+ | Demands extreme copper thickness and multiphase VRMs. |
| Transient Slewing Rate (dI/dt) | ~150 A/μs | >1000 A/μs | Demands ultra-low loop inductance and high-frequency decoupling. |
| Target PDN Impedance (Ztarget) | ~1.0 mΩ | <0.1 mΩ (100 μΩ) | Requires ultra-thin dielectric layers and high-layer HDI. |
As shown in the table, the shift to sub-1V core voltages leaves little room for error. An IR drop (voltage loss due to PCB trace resistance) of just 20mV on an 0.8V rail represents a 2.5% loss in voltage, which can cause clock errors and chip resets. Managing this requires keeping both DC resistance and AC impedance exceptionally low.
This generational shift in board design requirements is analyzed further in our technical comparison: A100 vs. H100: GPU Generational Leap & PCB Stack Differences Explained.
The Voltage Regulator Module (VRM) on an AI accelerator card is responsible for stepping down the high DC voltage feed from the system (typically 12V or 48V) to the sub-1V Vcore level required by the processor. To output hundreds of Amperes efficiently, VRM designs must utilize multiphase buck topologies.
By splitting the total current load across multiple parallel phases, the thermal dissipation is evenly distributed across the PCB, and the ripple current on both the input and output filters is significantly reduced. A typical high-performance AI board may implement a 16-phase, 24-phase, or even 32-phase VRM system managed by advanced multiphase PWM controllers.
[ 12V / 48V DC Input ]
|
+----+----+----+
| | |
[Phase 1] [Phase 2] ... [Phase N] <-- Smart Power Stages (SPS) with Integrated MOSFETs & Drivers
| | |
[Ind. 1] [Ind. 2] ... [Ind. N] <-- Coupled Inductors
| | |
+----+----+----+
|
[Decoupling Network] <-- MLCCs, Tant-Polymers, SP-Caps
|
[AI GPU]
Given the ultra-high density of VRMs on accelerator cards, placing them close to the BGA contacts is vital. This density often dictates high-density interconnect (HDI) routing strategies. You can read more about how HDI stacks support these high-performance components in our guide to HDI PCB for AI Servers.
While the VRM handles low-frequency power demands, it cannot respond fast enough to high-frequency transients. When the processor switches states in nanoseconds, it pulls charge from the decoupling capacitors placed along the PDN. To maintain a flat impedance profile across a wide frequency range, designers must deploy a multi-tiered decoupling strategy.
The core objective of decoupling design is to keep the AC impedance (Ztarget) of the power network below a calculated maximum limit up to a specific cutoff frequency (usually between 50 MHz and 200 MHz, beyond which the package and silicon die-level capacitances take over). The target impedance is calculated using the following formula:
Ztarget = (ΔV × Ripple%) / Itransient
Where:
For example, if Itransient is 400A at 0.8V with a 3% ripple allowance:
Ztarget = (0.8 × 0.03) / 400 = 0.024 / 400 = 60 μΩ (micro-Ohms)
No single capacitor can provide low impedance across the entire frequency spectrum due to its internal parasitic elements: Equivalent Series Resistance (ESR) and Equivalent Series Inductance (ESL). Therefore, we mix different capacitor technologies:
| Capacitor Class | Frequency Coverage | Typical Values | Physical Placement | Primary Function |
|---|---|---|---|---|
| Bulk Capacitors (Tantalum Polymer, Aluminum SP-Caps) | Low Frequency (10 kHz - 500 kHz) | 150 μF - 470 μF | Peripheral to the VRM and BGA outer ring | Sinks low-frequency ripple from VRM switching and handles slow load steps. |
| Mid-Frequency MLCCs (0805, 0603 Package) | Mid-Frequency (500 kHz - 10 MHz) | 1 μF - 22 μF | Clustered around the edges of the socket cavity | Bridges the gap between bulk capacitors and sub-BGA decoupling. |
| High-Frequency MLCCs (0402, 0201, 01005 Package) | High Frequency (10 MHz - 100 MHz+) | 10 nF - 470 nF | Directly on the back of the PCB under the BGA pins (cavity MLCCs) | Provides instant charge to the die, bypassing board parasitic inductance. |
To maximize the effectiveness of high-frequency MLCCs, the mounting loop inductance must be kept as low as possible. This is achieved by utilizing via-in-pad technology, placing power and ground vias immediately adjacent to the capacitor terminals, and utilizing extremely thin dielectric prepregs in the stackup to maximize plane-to-plane capacitance.
During PCBA production, placing hundreds of tiny 0201 or 01005 decoupling capacitors within tight BGA cavities demands precision SMT placement. For insights into the PCB assembly challenges associated with high-density components on AI accelerators, consult our BGA Assembly Guide.
Given the tight tolerances and extreme currents of AI boards, physical prototyping without pre-layout and post-layout simulation is highly risky. Engineers utilize specialised EDA tools to perform comprehensive PDN analysis in two main domains: DC (IR Drop) and AC (Impedance Spectrum).
DC analysis focuses on the resistance of the metal planes and vias. It simulates the continuous high-current flow from the VRM outputs to the processor inputs to ensure:
AC simulation maps the impedance of the power distribution path over a sweeping frequency spectrum (typically from 100 Hz to 1 GHz). The primary goals are to:
Engineers can calculate and manage characteristic line parameters pre-layout by using interactive software tools like the NextPCB PCB Impedance Calculator to quickly establish basic stackup variables.
The physical layout of the PCB is where the electrical rules of the PDN are realized. For AI accelerator cards, which typically utilize 24 to 36 layers of high-grade copper, layout execution must be precise.
To transport 1000A+, traditional 1 oz copper planes are completely inadequate. High-power planes for AI Vcore must use 2 oz or even 3 oz copper thickness on inner layers. The stackup must be designed symmetrically to avoid board warpage during soldering, pairing thick power layers with matching ground planes separated by ultra-thin dielectric materials (often 1 to 2 mils thick) to maximize inter-plane capacitance.
[Layer 1: Signal (Top)] [Layer 2: Ground Plane (1 oz)] [Layer 3: V_core Power Plane (2 oz)] <-- Thin Core (1.5 mil) for high capacitive coupling [Layer 4: Ground Plane (2 oz)] [Layer 5: V_core Power Plane (3 oz)] <-- Main current transport layer [Layer 6: Ground Plane (2 oz)] ...
For more details on mitigating the massive thermal output of these dense power delivery sections, see our dedicated guide: Thermal Management on AI Server PCBs: Copper Coin, Thermal Vias, and Heatsink Integration.
Furthermore, because high-current PDNs exist adjacent to high-speed digital lanes like PCIe Gen 5/6 and 112G PAM4 lines, proper shielding is crucial. To avoid capacitive or inductive cross-coupling between noisy power switching lanes and sensitive differential pairs, follow the routing parameters outlined in 112G PAM4 PCB Design for AI Servers: Material Selection, Trace Routing, and SI Rules.
Manufacturing a physical PCB that meets the mathematical requirements of an AI accelerator's power delivery network is highly challenging. It requires tight registration of 30+ layer stackups, reliable high-ratio microvia plating, and heavy copper processing capabilities.
NextPCB is a leading manufacturer of advanced, high-layer count HDI boards designed to handle the most demanding power architectures in modern computing. Our capabilities include:
For a direct quote on your high-layer count hardware, visit our Advanced PCB Quote Portal, or submit your bill of materials (BOM) to our turn-key assembly services via the PCB Assembly Quote tool.
AI accelerator cards operate at extremely high thermal and electrical loads. Standard FR-4 materials have a relatively low glass transition temperature (Tg < 150°C) and higher dielectric loss factors (Df > 0.02). The high heat dissipated by VRMs and the processors can quickly degrade standard FR-4, leading to board warpage, delamination, or dielectric breakdown. High-Tg, low-loss materials (such as Megtron series) maintain mechanical strength and physical dimensions under high temperatures, while providing lower dielectric loss for adjacent high-speed signal routing.
Anti-resonance occurs when two different capacitors (e.g., a 10 μF bulk capacitor and a 0.1 μF MLCC) are connected in parallel. At a certain frequency, the inductive reactance of the larger capacitor matches the capacitive reactance of the smaller capacitor. At this resonant frequency, their reactances cancel out, creating a high-impedance peak in the PDN. This peak can cause substantial voltage ripple if the processor switches at that specific frequency. To prevent this, designers use simulation tools to carefully space capacitor values, select low-ESR parts, or introduce small damping resistors/ferrite beads to flatten out resonance spikes.
Plane-to-plane capacitance (also called inter-plane capacitance) is the distributed capacitance formed between adjacent power and ground planes in the PCB stackup. Because this capacitance is distributed uniformly across the entire plane, it has virtually zero parasitic loop inductance. It acts as an incredibly fast decoupling capacitor, supplying instantaneous charge to the silicon at frequencies above 100 MHz, where physical SMT decoupling capacitors become ineffective due to via and trace inductance.
No. While thermal relief pads (spoked connections) make manual soldering easier by limiting heat dissipation, they significantly increase trace resistance and loop inductance. For high-current power rails (Vcore), vias must connect directly and fully (solid connection) to the power and ground planes. This ensures maximum current delivery and minimal resistance, even though it requires more advanced heating profiles during the assembly reflow process.
Need to manufacture AI server PCBs? Get a quote from NextPCB → Get an Advanced PCB Quote Now
Still, need help? Contact Us: support@nextpcb.com
Need a PCB or PCBA quote? Quote now