Home
PCB Quote
Standard PCB Advanced PCB
Rev 0 PCBA
PCB Assembly
Rev 0 PCBA PCB Assembly Quote PCB Assembly Service PCB Assembly Capability PCB Stencil Service BOM Service Free Functional Testing
Components Sourcing
HQ Online Components BOM Tool
Gerber Viewer | DFM
Online Gerber Viewer HQDFM Design Analysis Software HQDFM User Manual
Capabilities & Services

NextPCB Capabilities

Standard PCB Capabilities Advanced PCB Capabilities PCB Assembly Capabilities

Capabilities by PCB Types

PCB Product Showsase Rigid PCBs Rogers PCB High-TG PCBs Heavy Copper PCBs HDI PCBs High-Speed PCBs High-Frequency PCBs Aluminum PCBs Copper-Core PCBs Ceramic PCBs Flex PCBs Rigid-Flex PCBs

Printed Circuit Boards

PCB Prototype Applicable Industries PCB Manufacturing Process Advanced PCB Materials

PCB Assembly

PCB Assembly Service PCB Stencil Service File Requirements PCB Assembly Guide IC Programming PCBA DFA BGA Assembly Capabilities Laser Labeling/Coding

Layer Buildup

Layer Stack-up Prepregs, cores, foils

SMD-Stencils

Laser Stencil

PCB Design-Aid & Layout

Layer Orientation BGA PCB Price Composition Printed Circuit Board Materials PCB Design & Layout Panel Creation Gold Fingers

Mechanics

V-Scoring Back drilling PCB milling

Surface

Via Covering Surface Finish Silkscreen Solder mask

Quality

E-Test X-RAY Design Rule Check A.O.I

Drills & Throughplating

Via-in-pad Blind & Buried Vias Annular Rings Side plating Plated Half-holes/Castellated Holes Plated-through Slots

Factory & Certificate

PCB Factory VR Visiting PCB Assembly Factory Show Certificate

New users: $30 off
24 hours Fast Turnaround
100% E-test & AOI

Free for 10pcs 50%OFF for 100pcs TURNKEY PCB ASSEMBLY
Tools & Resources
PCB Impedance Calculator PCB Stackups & Impedance PCB Trace Width Calculator AI Electrical Rule Check KiCad Resource Hub KiCad Version Converter NextPCB Accelerator Program Blog News
About Us
About Us Contact Us Why Us Feedback Help Center Payment Methods Shipping Methods

0
Support Team

support@nextpcb.com

0086-755-8364 3663

+86 13622941920
Feedback:
support@nextpcb.com

Blog / Why AI GPUs Require 30+ Layer HDI PCBs

Why AI GPUs Require 30+ Layer HDI PCBs

Q: How does HDI reduce the required layer count compared to through-hole-only designs?

HDI does not necessarily reduce total layer count—it adds HDI build-up layers. However, it dramatically increases the routing density that can be achieved with a given number of layers, and it reduces the disruption to signal layers caused by through-hole via anti-pad clearances. Without HDI, routing from the inner rows of a fine-pitch GPU BGA would require so many through-hole vias that the anti-pad clearances in adjacent planes would effectively disrupt the reference plane beneath the critical NVLink routing—degrading impedance control across the entire BGA escape zone.

Q: Which PCB fabricators can manufacture 30+ layer HDI boards for AI servers?

Production-capable 30+ layer HDI fabrication for AI server boards is concentrated among tier-1 fabricators with sequential lamination presses rated for low-loss specialty laminates (Megtron 7, Tachyon 100G), laser drill systems capable of 75 μm microvia diameter at the required stacking depth, controlled-depth backdrill CNC equipment, and fully automated layer registration verification. NextPCB is equipped to handle AI server board fabrication across this complexity range, from 16-layer H100 era boards through 32+ layer B200 and NVSwitch designs.

Posted: June, 2026 Last Updated: June, 2026 Writer: Julia Wu Share:

Introduction

A standard desktop motherboard uses 6–8 PCB layers. A high-end enterprise server board uses 10–14. A DGX H100 GPU baseboard uses 20–24. A dedicated NVSwitch board in a GB200 NVL72 rack uses 32–40 or more. This progression is not incremental—it reflects a fundamental change in the engineering demands that AI accelerator hardware places on printed circuit boards.

The question “why do AI GPU PCBs need so many layers?” has a precise engineering answer, and understanding it matters for anyone designing, specifying, or procuring PCBs for AI server infrastructure. Layer count is not a measure of quality or prestige—it is the outcome of four specific engineering requirements that converge in AI GPU baseboards in a way that they do not in any other class of commercial electronics. This article explains each of those requirements in detail.

Table of Contents
Introduction
What Drives Layer Count in a PCB?
The Four Engineering Drivers Behind 30+ Layers in AI GPU Boards
Driver 1: High-Speed Signal Routing Density
Driver 2: Power Plane Count and PDN Architecture
Driver 3: Fine-Pitch BGA Escape Routing
Driver 4: Reference Plane Integrity for Signal Integrity
Layer Count by Board Type
What Is HDI and Why Does AI Hardware Need It?
HDI Structure Types
Laser-Drilled Microvias
Via-in-Pad (VIPPO)
Any-Layer HDI (ELIC)
Anatomy of a 28-Layer AI GPU Baseboard Stackup
NVSwitch Boards: The Most Complex Case
How Layer Count Affects Signal Integrity
How Layer Count Affects Power Delivery
How Layer Count Affects Thermal Management
Manufacturing Implications of High Layer Count HDI
Sequential Lamination
Layer Registration
Yield and Cost
Layer Count Comparison: Standard vs AI GPU PCBs
FAQ

What Drives Layer Count in a PCB?

PCB layer count is determined by the need to route electrical connections between components in a way that satisfies all applicable constraints simultaneously. Those constraints are:

Routing density: The total length of traces that must be routed, divided by the available routing area per layer; more traces in the same board area require more layers
Signal integrity: High-speed differential pairs require dedicated routing layers with adjacent reference planes; they cannot share layers with other signal types or power without degrading performance
Power delivery: Each distinct power domain benefits from one or more dedicated copper planes; more power rails mean more plane layers
Crosstalk isolation: Parallel high-speed signals must be spaced adequately or placed on layers separated by ground planes to prevent energy coupling; this separation requires additional layers
BGA escape routing: Fine-pitch BGA packages require via structures that consume layer transitions, and the routing that escapes the BGA footprint occupies layers that cannot be simultaneously used for other purposes

In a standard server motherboard, these constraints lead to 10–14 layers. In an AI GPU baseboard, all of these constraints are pushed to their limits simultaneously—leading to layer counts that would have been considered impractical in commercial electronics a decade ago.

The Four Engineering Drivers Behind 30+ Layers in AI GPU Boards

Driver 1: High-Speed Signal Routing Density

The dominant driver of high layer count in AI GPU baseboards is the sheer number of high-speed differential pairs that must be routed between GPU packages and NVSwitch packages.

In a DGX H100 baseboard with 8 H100 GPUs and 4 NVSwitch 3.0 chips:

Each H100 GPU has 18 NVLink 4.0 links, each consisting of multiple differential pairs
Each link connects from the GPU to one of the four NVSwitch chips
Each NVSwitch 3.0 has 64 NVLink ports connecting to all 8 GPUs
The total number of NVLink differential pairs routed on the baseboard exceeds 2,000 individual traces

Each of these differential pairs must be routed as a controlled-impedance structure (100 Ω ± 5%) with:

An adjacent reference plane (ground) immediately above or below the routing layer
Minimum edge-to-edge spacing from adjacent pairs of 2× trace width to meet crosstalk specifications
No vias, plane splits, or other discontinuities directly beneath any active portion of the trace

At 100 Gb/s per lane (NVLink 4.0), these constraints cannot be relaxed. A 75 μm wide trace with 75 μm space between pairs (3 mil / 3 mil) can route approximately 8–10 differential pairs across a 100 mm wide routing channel per layer. Routing 2,000+ pairs across the board therefore requires a minimum of 6–10 dedicated NVLink signal routing layers, depending on board width and routing efficiency.

For B200 boards with NVLink 5.0 at 200 Gb/s per lane, the spacing requirements are tighter (lower crosstalk budget), and the number of NVLink links per GPU increases. NVSwitch 4.0 boards in the GB200 NVL72, routing NVLink 5.0 connections between 72 GPUs via 9 switch boards, push signal routing density to the extreme end of what current PCB manufacturing can achieve.

Driver 2: Power Plane Count and PDN Architecture

AI GPU baseboards manage many distinct power domains simultaneously. Each domain that requires a dedicated power plane (rather than sharing a plane with other domains) adds layers to the stackup. For a DGX H100 baseboard, the major power domains include:

Power Domain	Voltage	Current (per 8-GPU board)	Plane Requirement
GPU VCORE (per GPU)	~0.85–0.9 V	~500 A per GPU × 8 = 4,000 A total	Dedicated planes per GPU; high copper weight
GPU HBM VDDQ	~1.1 V	~50–100 A per GPU	Shared or dedicated; noise-sensitive
NVSwitch VCORE (per chip)	~0.8 V	~200–300 A per chip × 4 chips	Dedicated planes for each NVSwitch
NVLink I/O voltage	~1.0–1.1 V	High aggregate current	Separate from VCORE; noise isolation required
PCIe I/O voltage	~0.85–1.0 V	Moderate	Can share with other I/O rails if noise isolated
Management / BMC rails	3.3 V, 1.8 V, 1.0 V	Low current	Can share layers; low priority for dedicated planes
Ground reference	0 V	Return for all above	Multiple dedicated ground planes required

The practical minimum for the power and ground plane layer count is 8–12 layers on a fully loaded H100 baseboard. Combined with the 6–10 NVLink signal layers and the HDI build-up layers, this accounts for the majority of the total 20–24 layer count.

For B200 baseboards with higher GPU TDP (1,000 W vs 700 W) and more power rails per GPU (dual-die B200 has more distinct power domains than the single-die H100), the power plane count increases further, pushing total layer count toward 28–32.

Driver 3: Fine-Pitch BGA Escape Routing

GPU packages (SXM5 for H100, SXM6 for B200) and NVSwitch chips are large, fine-pitch BGA components. Routing signals out of the inner rows of a BGA footprint requires vias that transition signals from the pad layer to inner routing layers—consuming layer transitions that are not available for other routing until the via escapes the BGA keepout zone.

For a 60 mm × 60 mm BGA package with 0.65 mm ball pitch, the inner rows of pads can only be accessed by routing signals through the outer rows using vias. Each via:

Occupies a routing channel on the layer it connects from and to
Introduces a stub (if through-hole) that must be backdrilled or eliminated by using blind/buried laser vias
Consumes pad real estate on every layer it passes through (anti-pad clearance)

HDI via structures (laser-drilled microvias and via-in-pad) allow BGA escape to be distributed across multiple build-up layers, dramatically increasing the routing density that can be achieved beneath a fine-pitch BGA. Without HDI, a 64-row BGA package like NVSwitch 3.0 cannot be efficiently escaped in a board of practical thickness—the number of through-hole vias required would consume more routing channel space than the vias free up.

The HDI build-up layers required for BGA escape add 2–6 layers to each side of the core (1+N+1 to 3+N+3 configurations), directly increasing total layer count beyond what the core routing alone would require.

Driver 4: Reference Plane Integrity for Signal Integrity

At NVLink 4.0 and NVLink 5.0 speeds, every signal routing layer must have an unbroken reference plane (ground) immediately adjacent to it. The characteristic impedance of a differential pair depends on the distance to the reference plane and the dielectric constant of the material between them. If the reference plane is interrupted (by a cutout, a connector clearance, or a power plane transition), the characteristic impedance changes locally, creating a reflection that degrades signal quality.

This means that signal layers and reference planes must alternate strictly: signal layer, reference layer, signal layer, reference layer. A board with 10 dedicated signal routing layers therefore requires a minimum of 10 reference (ground) plane layers interspersed among them—and this count does not include the power planes required by Driver 2.

In practice, ground planes often serve double duty (as both the reference for one signal layer above and the reference for a different signal layer below), reducing but not eliminating the requirement for dedicated ground layers. In a well-designed 28-layer stackup, approximately 8–10 layers are dedicated ground planes, with the remainder split between signal routing and power planes.

Layer Count by Board Type

Board Type	GPU Generation	Typical Layer Count	Primary Driver of High Count
Consumer GPU add-in card (RTX 4090)	Ada Lovelace	8–12	PCIe Gen4, GDDR6X routing, moderate PDN
Data center inference GPU (L40S PCIe)	Ada Lovelace	12–16	PCIe Gen4, GDDR6 routing, no NVLink
A100 HGX baseboard	Ampere	14–18	NVLink 3.0, 6 × NVSwitch 2.0, HBM2e power
H100 HGX baseboard	Hopper	20–24	NVLink 4.0, 4 × NVSwitch 3.0, PCIe Gen5
MI300X OAM UBB	CDNA 3	16–22	Infinity Fabric, PCIe Gen5, 48 V PDN
B200 GPU baseboard	Blackwell	24–32	NVLink 5.0, PCIe Gen6 PAM4, 1,000 W PDN
NVSwitch 3.0 board (H100 era)	Hopper	28–36	All NVLink 4.0 fabric routing; no GPU PDN
NVSwitch 4.0 board (GB200 NVL72)	Blackwell	32–40+	NVLink 5.0 at maximum routing density; 400 W per NVSwitch chip

For detailed layer count analysis of specific GPU generations, see A100 vs H100: GPU Generational Leap & PCB Stack Differences Explained and NVIDIA Blackwell Architecture Explained.

What Is HDI and Why Does AI Hardware Need It?

HDI (High-Density Interconnect) refers to PCB technology that achieves higher routing density than standard through-hole via designs by using laser-drilled microvias, fine trace/space geometries, and sequential lamination build-up layers. HDI technology is not optional for AI GPU baseboards—it is a prerequisite for routing the BGA escape of GPU and NVSwitch packages within the board thickness and footprint constraints of a practical server form factor.

HDI Structure Types

HDI Type	Description	Layer Count Notation	Typical Use Case
Type I	One build-up layer on one or both sides; blind vias from outer layer to first inner layer; through-holes in core	1+N+1 (both sides) or 1+N (one side)	Moderate-density BGA escape; data center server motherboards
Type II	Two build-up layers per side; stacked or staggered microvias; through-holes in core	2+N+2	H100 HGX baseboard; fine-pitch BGA escape for SXM5 and NVSwitch 3.0
Type III	Three build-up layers per side; stacked microvias; through-holes in core	3+N+3	B200 baseboard; extreme BGA density; some NVSwitch boards
Any-Layer HDI (ELIC)	Every layer interconnectable via stacked filled microvias; no through-holes spanning full board thickness	All-layer microvia	GB200 NVL72 NVSwitch boards; maximum routing density at extreme layer counts

Laser-Drilled Microvias

Microvias are small-diameter (75–150 μm) vias drilled by laser rather than mechanical drill. They connect only two adjacent layers (blind vias) or span buried layers (buried vias), as opposed to through-hole vias that span the full board thickness.

Advantages for AI GPU PCBs:

No stub: Blind microvias do not create stubs below the signal layer, eliminating the need for backdrilling on microvia-connected signals. Through-hole vias on NVLink signal layers still require backdrilling to remove stubs, but replacing through-holes with microvias on critical paths eliminates this requirement entirely for those connections
Smaller anti-pad: Microvia anti-pads (the copper clearance zone in adjacent planes) are smaller than through-hole anti-pad clearances, reducing disruption to the reference planes adjacent to high-speed signal layers
Higher routing density under BGAs: Stacked microvias allow signal escape through multiple layers within a very small footprint, enabling routing beneath fine-pitch BGA packages where through-hole via anti-pad clearances would consume all available routing space

Via-in-Pad (VIPPO)

Via-in-pad places a microvia (or filled through-hole via) directly within a BGA pad. The via is filled with conductive or non-conductive epoxy, planarized, and cap-plated to present a flat, solderable surface. This technique is essential for GPU and NVSwitch BGA escape because:

It eliminates the routing channel that would otherwise be required to escape the via to an adjacent pad before connecting to an inner layer
It allows signal connections to be made at the pad itself, freeing the routing channels between pads for other signals
In very-fine-pitch BGA arrays (< 0.8 mm pitch), routing from pad to via is geometrically impossible without via-in-pad—there is simply no space for a trace between the pads

The manufacturing process for via-in-pad involves additional steps (epoxy fill, cure, planarization grinding, cap plating) and tighter process controls than standard via processing. See How GPU PCBs Are Manufactured: From Bare Board to Final PCBA for a detailed description of the via-in-pad manufacturing process.

Any-Layer HDI (ELIC)

Any-Layer HDI (also called ELIC—Every Layer Interconnect) takes HDI to its logical extreme: every layer in the stackup is interconnected by stacked, filled microvias. There are no traditional through-holes spanning the full board thickness. Signals can transition between any two layers using a stack of filled microvias, regardless of which layers are involved.

ELIC is the highest-complexity and highest-cost PCB structure in commercial production. It is reserved for applications where routing density requirements cannot be met with any other approach. In AI GPU hardware, ELIC is used on dedicated NVSwitch 4.0 boards in the GB200 NVL72, where the combination of 72 GPU NVLink 5.0 connections, 400 W per NVSwitch chip power delivery, and the physical constraints of the rack-integrated switch board format creates routing demands that exceed what 2+N+2 or 3+N+3 HDI can solve.

Anatomy of a 28-Layer AI GPU Baseboard Stackup

The following is a representative 28-layer stackup for a B200-class GPU baseboard with 2+N+2 HDI, illustrating how the layers are allocated among signal routing, power planes, and ground references:

Layer	Function	Material	Copper Weight
L1 (outer)	SMT pads; BGA cap-plated via-in-pad; low-speed signals	Build-up prepreg (RCC)	1 oz (ENIG finished)
L2	Ground reference (HDI build-up layer 1)	Build-up prepreg	1 oz
L3	BGA escape routing; NVLink 5.0 breakout (build-up layer 2)	Build-up prepreg	½ oz
L4	Ground reference plane	Core laminate (Megtron 7)	1 oz
L5	NVLink 5.0 signal routing (horizontal)	Megtron 7	½ oz VLP copper
L6	Ground reference plane	Megtron 7	1 oz
L7	NVLink 5.0 signal routing (vertical)	Megtron 7	½ oz VLP copper
L8	Ground reference plane	Megtron 7	1 oz
L9	GPU VCORE power plane	Megtron 6 (or standard)	2 oz
L10	Ground plane (power return)	Megtron 6	2 oz
L11	NVLink 5.0 signal routing (horizontal)	Megtron 7	½ oz VLP copper
L12	Ground reference plane	Megtron 7	1 oz
L13	NVLink 5.0 signal routing (vertical)	Megtron 7	½ oz VLP copper
L14	PCIe Gen6 signal routing + NVLink 5.0	Megtron 7	½ oz VLP copper
L15	Ground reference plane (board center)	Megtron 6	1 oz
L16	NVSwitch VCORE power plane	Megtron 6	2 oz
L17	Ground plane (NVSwitch power return)	Megtron 6	2 oz
L18	NVLink 5.0 signal routing (horizontal)	Megtron 7	½ oz VLP copper
L19	Ground reference plane	Megtron 7	1 oz
L20	NVLink 5.0 signal routing (vertical)	Megtron 7	½ oz VLP copper
L21	HBM / auxiliary power planes (split plane)	Megtron 6	1 oz
L22	Ground reference plane	Megtron 6	1 oz
L23	NVLink 5.0 signal routing (horizontal)	Megtron 7	½ oz VLP copper
L24	Ground reference plane	Megtron 7	1 oz
L25	Management signals; low-speed I/O	Core laminate	1 oz
L26	BGA escape routing (build-up layer, bottom)	Build-up prepreg	½ oz
L27	Ground reference (HDI build-up layer, bottom)	Build-up prepreg	1 oz
L28 (outer)	SMT pads; bottom-side component lands	Build-up prepreg (RCC)	1 oz (ENIG finished)

This representative stackup allocates: 8 NVLink 5.0 signal routing layers, 1 PCIe Gen6 layer, 2 BGA escape layers (HDI build-up), 10 ground/reference plane layers, 4 power plane layers (VCORE, NVSwitch, HBM), 1 management signal layer, and 2 outer pad layers. The Megtron 7 material is used on all NVLink and PCIe signal layers; Megtron 6 is used on power and ground planes where dielectric loss is irrelevant; build-up prepreg is used for the HDI layers.

NVSwitch Boards: The Most Complex Case

Dedicated NVSwitch boards—such as those used in the GB200 NVL72 rack—represent the most demanding PCB design problem in current commercial production. These boards carry no GPU packages; their sole function is to switch NVLink 5.0 traffic between all 72 B200 GPUs in the rack.

This specialization makes them simultaneously simpler (no GPU power delivery requirements) and more demanding (higher NVLink routing density per unit board area) than GPU baseboards:

Each NVSwitch 4.0 chip has 72 NVLink 5.0 ports, each requiring multiple differential pairs to be routed to the edge connectors that connect to GPU compute trays
Multiple NVSwitch chips per board multiply the total pair count; a single NVSwitch board in the GB200 NVL72 may route over 3,000 NVLink 5.0 differential pairs at 200 Gb/s per lane
The loss budget for each NVLink 5.0 channel is fixed by the NVLink specification; longer trace routing on the switch board consumes more of this budget, demanding the lowest-loss laminates available (Megtron 7, Df < 0.002) and HVLP copper foil throughout the NVLink signal layers
Any-layer HDI (ELIC) is required for NVSwitch 4.0 boards because the routing density beneath the NVSwitch package BGA, combined with the number of signal layers required, exceeds what stacked-microvia 3+N+3 HDI can achieve within the board thickness constraints

Layer counts of 32–40 or more are typical for NVSwitch 4.0 boards. These are among the most technically challenging PCBs in commercial production and are manufactured by fewer than a handful of qualified fabricators worldwide.

How Layer Count Affects Signal Integrity

Higher layer count is not automatically better for signal integrity—it is a necessary accommodation for the routing constraints that arise when signal density is high. Several signal integrity effects are influenced directly by layer count decisions:

Via stub length: In a thicker board (more layers), through-hole vias are longer, and their stubs are proportionally longer. A 6 mm thick 28-layer board has longer stubs than a 3.2 mm thick 16-layer board for the same via configuration. Backdrilling becomes more critical, and the depth accuracy requirement is more demanding because the absolute stub length permitted (< 10 mils for NVLink 4.0) represents a smaller fraction of the total board thickness
Anti-pad coupling: In a board with many closely spaced layers, the anti-pad clearances in ground planes for through-hole vias can create arrays of coplanar waveguide discontinuities that couple energy between adjacent via structures. This is mitigated by minimizing anti-pad size (only possible with tighter manufacturing tolerances) and by using HDI microvias in place of through-holes on critical signal paths
Dielectric thickness uniformity: In a 28-layer board with 5 lamination cycles, cumulative dielectric thickness variation can be larger than in a simple single-press board. Tighter control of prepreg thickness and press cycle parameters is required to maintain ± 5% impedance tolerance across the full board

For a detailed treatment of signal integrity requirements at NVLink and PCIe Gen5/6 speeds, see What Is NVLink? How NVIDIA's High-Speed GPU Interconnect Shapes PCB Routing and the AI Accelerator PCB Design Guide.

How Layer Count Affects Power Delivery

Power plane layers in a high-layer-count board have lower sheet resistance (more copper area available for the same current) and lower inductance (more planes means more capacitance between adjacent power and ground planes, which provides high-frequency decoupling). Both effects improve PDN performance:

Lower plane resistance: With multiple power layers available, the GPU VCORE plane can be distributed across two adjacent layers connected by vias, halving the effective sheet resistance and reducing DC voltage drop from VRM to GPU package. At 500 A per GPU, a 0.5 mΩ plane resistance difference translates to a 250 mV voltage drop difference—significant against a 0.85 V VCORE target
Distributed capacitance: Adjacent power-ground plane pairs form a parallel plate capacitor. A 28-layer board with multiple power-ground plane pairs distributed through the stackup has more distributed capacitance than a 12-layer board, improving high-frequency PDN impedance without adding discrete decoupling capacitors
PDN target impedance: H100 and B200 boards target < 0.15 mΩ from DC to 100 MHz at the GPU package. Achieving this requires careful PDN design that leverages the distributed capacitance of the multi-layer plane stack, VRM placement optimization, and a tiered discrete decoupling capacitor network. The additional layers that accommodate power planes are not merely routing conveniences—they are essential contributors to PDN performance

How Layer Count Affects Thermal Management

The relationship between layer count and thermal management is less direct than for signal integrity or PDN, but real:

Thermal via density: Higher layer count boards have more internal copper plane layers that can act as heat spreaders when connected by thermal vias beneath high-power components. A dense thermal via array (0.4–0.6 mm pitch) beneath a GPU package spreads heat laterally across the nearest copper plane and vertically through successive planes to the cold plate contact area. More internal planes provide more horizontal spreading capacity
Board thickness and CTE: A 28-layer board is thicker than a 14-layer board (approximately 4–6 mm vs 2–3 mm); greater thickness means larger temperature gradients through the board cross-section and more thermal stress at the interface between the PCB and rigid components (BGA solder joints) during thermal cycling. Material T_g requirements are accordingly more stringent for high-layer-count boards
Copper coin integration: Some B200 designs embed copper coins (solid copper inserts in machined PCB cavities) beneath GPU packages to provide a low-thermal-resistance path from the package to the cold plate. The feasibility of copper coin integration depends on board thickness and the availability of adequate layer transitions around the coin area—design considerations that are more complex in 28+ layer boards than in thinner designs

Manufacturing Implications of High Layer Count HDI

Sequential Lamination

A 28-layer 2+N+2 HDI board requires a minimum of three lamination press cycles:

Core lamination: presses the 24 inner layers (12 pairs of core and prepreg) into the base board
First build-up: adds the L3 and L26 HDI layers on each side; laser drilling of L1–L3 and L26–L28 microvias follows
Second build-up: adds the L2 and L27 outer HDI layers; laser drilling of L1–L2 and L27–L28 microvias follows

Each press cycle adds lead time (typically 1–2 days per cycle, plus drilling and plating time), and each cycle introduces thermal stress on already-processed inner layers. Fabricators must qualify their press cycles for each material combination to ensure that the inner layer copper and laminate do not degrade during subsequent press cycles at elevated temperature and pressure.

Layer Registration

In a 28-layer board assembled through three press cycles, cumulative layer-to-layer misregistration is a critical quality control challenge. Registration accuracy of ± 50 μm per layer, accumulated across 28 layers and three lamination cycles, demands:

Pin registration tooling that precisely locates each layer relative to a common datum through all press cycles
Laser drilling of microvias referenced to optical fiducials on the panel surface rather than to mechanical tooling holes, to compensate for thermal expansion differences between lamination cycles
X-ray verification of microvia alignment after each laser drilling step; misaligned microvias (center offset > 50 μm from the target pad center) create high-resistance connections that can cause intermittent failures in service
Cross-section analysis of test coupons at multiple board locations per production lot to verify layer registration and dielectric thickness

Yield and Cost

High layer count and HDI complexity have a direct impact on yield and unit cost:

Each additional press cycle adds yield loss risk; a defect introduced in the core (an inner layer open or short missed by AOI) is unrecoverable after lamination and results in a scrapped panel
Laser drilling yield depends on the consistency of the dielectric surface exposed after each lamination cycle; contamination or thickness variation causes mis-drilled vias
Material cost for Megtron 7 is approximately 3–5× the cost of standard FR4 on a per-area basis; a 28-layer board using Megtron 7 on 8 signal layers and Megtron 6 on the remainder has a material cost significantly higher than an equivalent board in standard laminate
Prototype lead times for 30+ layer HDI boards with multiple lamination cycles, backdrilling, and via-in-pad processing are typically 25–40 business days; production lead times with volume pricing are 15–25 business days

The manufacturing complexity of these boards is one reason that AI server PCB production is concentrated among a small number of tier-1 PCB fabricators with the capital equipment, process expertise, and quality systems required.

Layer Count Comparison: Standard vs AI GPU PCBs

Parameter	Standard Server Board (12L)	H100 HGX Baseboard (22L)	B200 Baseboard (28L)	NVSwitch 4.0 Board (36L)
Total layers	12	22	28	36
Signal routing layers	6	8–10	10–12	14–18
Power plane layers	2	4–6	6–8	4–6
Ground/reference layers	4	8–10	10–12	14–16
HDI build-up layers per side	0	1–2	2–3	2–3 (or ELIC)
Lamination press cycles	1	2–3	3–4	4–5
Signal layer laminate	Standard FR4	Megtron 6E / Tachyon 100G	Megtron 7	Megtron 7 / HVLP copper
Minimum trace/space	100 μm / 100 μm	75 μm / 75 μm	75 μm / 75 μm	50–75 μm / 50–75 μm
Via-in-pad (VIPPO)	No	Yes	Yes	Yes
Backdrilling required	No	Yes (NVLink 4.0 + PCIe Gen5)	Yes (NVLink 5.0 + PCIe Gen6)	Yes / replaced by microvias
Typical fabrication lead time	5–10 days	15–20 days	20–30 days	30–40 days
Relative material cost	1×	~5–8×	~10–15×	~15–25×

FAQ

Is there a practical upper limit to how many layers an AI GPU PCB can have?
Yes, though it is set by manufacturing physics rather than an arbitrary rule. As layer count increases, board thickness increases, aspect ratio (board thickness / via diameter) rises, and it becomes progressively more difficult to achieve reliable copper plating in deep through-holes. At approximately 40+ layers with through-hole vias, aspect ratios exceed 20:1, and plating uniformity in the via barrel degrades below IPC Class 3 requirements. Any-layer HDI (ELIC) avoids this problem by eliminating full-board-thickness through-holes entirely, replacing them with stacked microvias. ELIC effectively removes the practical layer count ceiling imposed by through-hole plating constraints.

Do all GPU PCBs need 30+ layers, or just training boards?
Not all GPU PCBs need 30+ layers. Consumer GPU cards (RTX 4090) use 8–12 layers. Data center inference GPUs in PCIe add-in card format (L40S, A10G) use 12–16 layers. The 30+ layer count is specific to training-focused AI server baseboards that carry multiple GPU packages, NVSwitch chips, and route full NVLink 4.0 or 5.0 fabric connections. See AI Training vs AI Inference: Why They Need Different PCB Designs for a comparison of layer count requirements across deployment types.

What is the minimum layer count for an H100 SXM5 baseboard?
A functional H100 SXM5 baseboard with 8 GPUs and 4 NVSwitch 3.0 chips requires a minimum of approximately 18–20 layers to route all NVLink 4.0 connections, provide adequate power planes, and escape the BGA footprints of all packages. Designs at the low end of this range make aggressive routing compromises (shared routing channels, tighter spacing) that may compromise signal margins; 22–24 layers is the more practical minimum for a production-quality H100 baseboard that meets all signal integrity specifications with adequate margin.

Why can't AI GPU boards use fewer layers with denser routing per layer?
Two constraints prevent simply routing more traces per layer. First, the crosstalk specification between NVLink differential pairs limits how closely traces can be packed on a single layer; at 100–200 Gb/s per lane, near-end crosstalk (NEXT) must be < −30 dB, which requires a minimum edge-to-edge spacing of approximately 2× trace width between adjacent pairs. Increasing routing density beyond this spacing violates the crosstalk budget. Second, each signal layer must have an adjacent reference plane; eliminating reference planes to add more signal layers degrades impedance control and crosstalk simultaneously.

How does HDI reduce the required layer count compared to through-hole-only designs?
HDI does not necessarily reduce total layer count—it adds HDI build-up layers. However, it dramatically increases the routing density that can be achieved with a given number of layers, and it reduces the disruption to signal layers caused by through-hole via anti-pad clearances. Without HDI, routing from the inner rows of a fine-pitch GPU BGA would require so many through-hole vias that the anti-pad clearances in adjacent planes would effectively disrupt the reference plane beneath the critical NVLink routing—degrading impedance control across the entire BGA escape zone.

Which PCB fabricators can manufacture 30+ layer HDI boards for AI servers?
Production-capable 30+ layer HDI fabrication for AI server boards is concentrated among tier-1 fabricators with sequential lamination presses rated for low-loss specialty laminates (Megtron 7, Tachyon 100G), laser drill systems capable of 75 μm microvia diameter at the required stacking depth, controlled-depth backdrill CNC equipment, and fully automated layer registration verification. NextPCB is equipped to handle AI server board fabrication across this complexity range, from 16-layer H100 era boards through 32+ layer B200 and NVSwitch designs.

Need to Manufacture AI Server PCBs?

Whether you are fabricating a 20-layer H100 baseboard or pushing the limits of a 36-layer any-layer HDI NVSwitch board, NextPCB provides the sequential lamination capability, low-loss laminate processing, laser drilling, via-in-pad, backdrilling, and IPC Class 3 quality standards required for AI server PCB production.

Upload & Get Your Instant Quote Now Engineer Consultation

Related Articles:

About the Author

Julia Wu - Senior Sales Engineer at NextPCB.com

With over 10 years of experience in the PCB industry, Julia has developed a strong technical and sales expertise. As a technical sales professional, she specializes in understanding customer needs and delivering tailored PCB solutions that drive efficiency and innovation. Julia works closely with both engineering teams and clients to ensure high-quality product development and seamless communication, helping businesses navigate the complexities of PCB design and manufacturing. Julia is dedicated to offering exceptional service and building lasting relationships in the electronics sector, ensuring that each project exceeds customer expectations.

972 0 0 1 Facebook Twitter Linked In