Lolly Zheng- Sales Account Manager at NextPCB.com
Support Team
Feedback:
support@nextpcb.comThe race for AI infrastructure dominance has pushed semiconductor companies to continuously reinvent their accelerator architectures. While NVIDIA has historically led the charge, AMD's Instinct line has emerged as a formidable alternative. The AMD MI300X introduced a revolutionary chiplet-based architecture, pairing CDNA 3 cores with massive High Bandwidth Memory (HBM). Now, the upcoming MI350 series, built on the advanced CDNA 4 architecture, promises exponential leaps in AI inference and training capabilities.
For hardware engineers and PCB designers, transitioning from the MI300X to the MI350 is not simply a matter of swapping silicon. As computing power scales, so do the physical and electrical demands placed on the underlying printed circuit boards. The transition from one generation to the next dictates fundamental shifts in how we approach IC packaging, substrate design, and carrier board fabrication. Understanding the nuances of IC Substrate vs PCB manufacturing is critical, as the density of the MI350's advanced packaging directly affects the pinout and routing escape strategies on the main accelerator board.
The race for AI infrastructure dominance has pushed semiconductor companies to continuously reinvent their accelerator architectures. While NVIDIA has historically led the charge, AMD's Instinct line has emerged as a formidable alternative. The AMD MI300X introduced a revolutionary chiplet-based architecture, pairing CDNA 3 cores with massive High Bandwidth Memory (HBM). Now, the upcoming MI350 series, built on the advanced CDNA 4 architecture, promises exponential leaps in AI inference and training capabilities.
For hardware engineers and PCB designers, transitioning from the MI300X to the MI350 is not simply a matter of swapping silicon. As computing power scales, so do the physical and electrical demands placed on the underlying printed circuit boards. The transition from one generation to the next dictates fundamental shifts in how we approach IC packaging, substrate design, and carrier board fabrication. Understanding the nuances of IC Substrate vs PCB manufacturing is critical, as the density of the MI350's advanced packaging directly affects the pinout and routing escape strategies on the main accelerator board.
To understand the PCB design impact, we must first look at the raw specifications that differentiate these two generation-defining AMD AI GPUs. The increase in memory bandwidth, transistor count, and overall power consumption completely changes the constraints for the board layout.
| Specification | AMD Instinct MI300X | AMD Instinct MI350 (Expected) | PCB Design Impact |
|---|---|---|---|
| Architecture | CDNA 3 | CDNA 4 | Requires updated signal integrity modeling for higher frequency switching. |
| Memory | 192 GB HBM3 | Up to 288 GB HBM3E | Greater thermal density and stricter decoupling capacitor placement requirements. |
| Process Node | 5nm / 6nm (Chiplets) | 3nm / 4nm | Lower core voltage (Vdd) resulting in higher transient current spikes. |
| Interconnect | Infinity Fabric (PCIe Gen 5) | Advanced Infinity Fabric (PCIe Gen 6 Ready) | Demands ultra-low loss PCB materials and tighter impedance control. |
| TDP (Thermal Design Power) | 750W | Up to 1000W+ | Requires extreme Power Delivery Network (PDN) upgrades and liquid cooling. |
The sheer density of the AMD MI300X and MI350 modules forces PCB designers to abandon standard fabrication rules in favor of high-density interconnect (HDI) technologies. Accommodating thousands of BGA pins, high-current power planes, and high-speed differential pairs simultaneously is a monumental challenge.
To achieve this, engineers typically specify boards with layer counts far exceeding traditional consumer electronics. If you have ever wondered Why AI GPUs Require 30+ Layer HDI PCBs, the AMD Instinct series is a prime example. The stackup often consists of 24 to 32 layers, utilizing any-layer HDI structures, multiple sequential laminations, and microvias to allow signal escape from the dense central chiplet complex.
Material selection is equally critical. Standard FR-4 cannot handle the frequencies required by the MI350. Designers must specify ultra-low-loss laminates to prevent signal attenuation over long traces. Selecting the right High-Speed PCB Materials, such as Panasonic Megtron 7, Rogers, or Isola Tachyon, is mandatory to maintain the integrity of the high-speed Infinity Fabric interconnects.
While the MI300X leverages PCIe Gen 5 and mature Infinity Fabric protocols, the MI350 pushes the boundary toward PCIe Gen 6 and higher-speed chip-to-chip communications. This generational leap means that the margin for error in signal integrity (SI) shrinks to near zero.
When laying out the MI300X, adhering to PCIe Gen5 PCB Design Guidelines is sufficient. However, the MI350's underlying infrastructure demands readiness for the next generation. Engineers must familiarize themselves with PCIe Gen6 PCB Design Challenges, particularly the shift from NRZ (Non-Return-to-Zero) to PAM4 (Pulse Amplitude Modulation 4-level) signaling. PAM4 is highly sensitive to cross-talk, jitter, and insertion loss.
Furthermore, as networking speeds scale, board designers working on the Universal Baseboard (UBB) connecting these GPUs must implement strict 112G PAM4 PCB Design rules. This involves back-drilling to remove via stubs, utilizing teardrop pads, and routing traces at specific angles to prevent glass-weave skew (the fiber weave effect).
Perhaps the most daunting task when transitioning from the MI300X (750W) to the MI350 (up to 1000W+) is engineering the Power Delivery Network (PDN). According to Joule's law (P = I2R), as core voltages drop below 0.7V and power hits 1000W, the current required easily exceeds 1400 Amps.
Delivering 1400 Amps to a single silicon package requires a flawless PDN design. Engineers must utilize massive copper planes (often 2 oz or heavier on internal power layers) to minimize DC resistance (DCR). If the DCR is too high, power is lost as heat, severely impacting the efficiency of the AI server.
Voltage Regulator Modules (VRMs) must be placed as close to the GPU die as physically possible to minimize loop inductance. Furthermore, to handle the rapid transient load steps characteristic of AI training workloads (where the GPU can swing from idle to 100% utilization in nanoseconds), thousands of decoupling capacitors are placed on the reverse side of the PCB, directly beneath the GPU footprint.
Dissipating 1000W of heat from a concentrated area requires integration between the PCB layout and the mechanical cooling solution. Traditional air cooling is often insufficient for top-tier MI350 deployments, pushing the industry heavily toward liquid cooling.
To support this, the PCB must be designed to interface perfectly with cold plates. Proper Liquid Cooling Integration requires strict component height restrictions and flat keep-out zones on the PCB surface so the cold plate makes optimal contact with the GPU die and VRMs.
The PCB itself acts as a secondary heat sink. Designers utilize hundreds of thermal vias beneath high-power components like MOSFETs and inductors. The structural stackup of a typical AI module with a cold plate looks like this:
[ Liquid Cooling Cold Plate ] ============================= (Thermal Interface Material / TIM) [ AMD MI350 GPU & HBM3E Die ] ----------------------------- (Substrate) [ High-Density OAM PCB ] |-- Top Signal Layers |-- Heavy Copper Power Planes (Heat spreaders) |-- Thermal Vias |-- Bottom Signal Layers ----------------------------- [ High-Speed Mezzanine Connectors ]
Unlike NVIDIA's proprietary SXM modules, AMD heavily supports the Open Compute Project (OCP) standards. Both the MI300X and the MI350 are designed around the Open Accelerator Module (OAM) specification. Understanding What Is an OAM Module is essential for hardware engineers building AI clusters, as it dictates the mechanical dimensions, mounting holes, and high-speed connector placements.
Manufacturing and assembling an OAM module of this complexity is not for standard board houses. The immense size of the BGA package, combined with the heavy copper planes, makes PCB assembly (PCBA) highly sensitive to thermal warping during the reflow soldering process. Engineers should consult detailed OAM PCB Assembly Guides to understand the rigorous profiling, 3D X-ray inspection, and specialized jigs required to ensure zero defect soldering on these high-value boards.
Ultimately, these OAM modules are installed onto an overarching Universal Baseboard (UBB), forming the core of the AI cluster. If you are involved in full-system design, reviewing GPU Rack Architecture principles will help you understand how multiple AMD GPUs communicate across the baseboard and out to the network switches.
Q: Why does the AMD MI350 require more PCB layers than the MI300X?
A: The MI350 features a higher pin density due to increased memory bandwidth (HBM3E), faster PCIe/Infinity Fabric lanes, and significantly more complex power delivery routing. This forces designers to add more signal and ground plane layers to successfully escape all signals while maintaining impedance control.
Q: Can I use FR-4 material for an AMD MI300X baseboard?
A: No. Standard FR-4 has high dielectric loss (Df) which will severely degrade the high-speed PCIe Gen 5/6 and Infinity Fabric signals. Ultra-low-loss materials like Megtron 7 or advanced Rogers laminates are required.
Q: How do PCB manufacturers test AI accelerator boards?
A: Manufacturers use specialized Flying Probe or Bed-of-Nails fixtures for electrical testing. For assembly, 3D Automated Optical Inspection (AOI) and advanced X-ray imaging are mandatory to inspect the hidden solder joints beneath the massive GPU BGA package.
The transition from the AMD MI300X to the MI350 represents a massive leap in AI computing power, but it brings equally massive challenges in PCB engineering. From routing PAM4 signals on 30-layer HDI boards to designing Power Delivery Networks capable of handling over 1000W, every aspect of the hardware must be meticulously crafted.
Whether you are working with the OAM form factor, designing the Universal Baseboard, or tackling liquid cooling integration, partnering with a fabricator who understands the extreme tolerances of AI hardware is non-negotiable.
Need to manufacture AI server PCBs? Get a quote from NextPCB →
Still, need help? Contact Us: support@nextpcb.com
Need a PCB or PCBA quote? Quote now