Contact Us
Blog / A Hardware Overview from PCB to Full Rack

A Hardware Overview from PCB to Full Rack

Posted: June, 2026 Last Updated: June, 2026 Writer: Arya Li Share: NEXTPCB Official youtube NEXTPCB Official Facefook NEXTPCB Official Twitter NEXTPCB Official Instagram NEXTPCB Official Linkedin NEXTPCB Official Tiktok NEXTPCB Official Bksy

Abstract: The era of large language models (LLMs) and generative AI has transformed the modern data center. Training models with hundreds of billions of parameters is no longer a task for a single server; it requires an interconnected armada of compute engines. Building an AI GPU cluster involves navigating complex physical and electrical engineering challenges, bridging the gap between nanometer-scale silicon and megawatt-scale facility infrastructure. From the ultra-dense High-Density Interconnect (HDI) PCBs that host the accelerators to the massive fiber-optic fabrics that tie thousands of nodes together, this guide provides a comprehensive overview of AI datacenter hardware. We will explore the critical components, networking topologies, thermal management solutions, and the fundamental PCB manufacturing capabilities required to bring a modern GPU cluster to life.


  1. Table of Contents


The Anatomy of an AI GPU Cluster

An AI GPU cluster is essentially a supercomputer purpose-built for parallel processing. Unlike traditional cloud computing environments which are optimized for virtualization and isolated microservices, a GPU cluster operates as a single, cohesive brain during AI training workloads. The hardware architecture is defined by its hierarchy, scaling from the individual printed circuit board up to the warehouse-scale deployment.

To understand the hardware ecosystem, we must look at the logical physical groupings:

  • The Chip: The individual GPU or AI accelerator (e.g., NVIDIA H100, AMD MI300X, or custom ASICs).
  • The Baseboard (UBB/OAM): A highly complex, multi-layer PCB housing multiple GPUs (typically 4 or 8) and their immediate scale-up switches.
  • The Node (AI Server): A single chassis (usually 4U to 8U) containing the GPU baseboard, the host CPU motherboard, system memory, local NVMe storage, and network interface cards (NICs).
  • The Rack: A standard 19-inch or 21-inch cabinet housing multiple AI nodes, top-of-rack (ToR) switches, power shelf units, and cooling manifolds.
  • The Pod (or SuperPod): A standardized cluster block consisting of multiple racks (e.g., 32 racks) connected via a non-blocking network fabric.
  • The Cluster: Multiple pods connected via a core spine network, scaling up to tens of thousands of GPUs.

Below is a simplified structural diagram of a standard Clos (Fat-Tree) AI cluster network topology, illustrating how individual racks scale out:


=====================================================================
                    AI GPU CLUSTER ARCHITECTURE
=====================================================================

       [ Core Spine Switches (Director Class 400G/800G) ]
         /              |               |               \
        /               |               |                \
[ Leaf Switch 1 ] [ Leaf Switch 2 ] [ Leaf Switch 3 ] [ Leaf Switch N ]
      |                 |               |                |
  (Optics)          (Optics)        (Optics)         (Optics)
      |                 |               |                |
[   RACK 1    ]   [   RACK 2    ]   [   RACK 3    ]  [   RACK N    ]
+-------------+   +-------------+   +-------------+  +-------------+
| ToR Switch  |   | ToR Switch  |   | ToR Switch  |  | ToR Switch  |
|-------------|   |-------------|   |-------------|  |-------------|
| GPU Node A  |   | GPU Node A  |   | GPU Node A  |  | GPU Node A  |
| GPU Node B  |   | GPU Node B  |   | GPU Node B  |  | GPU Node B  |
| GPU Node C  |   | GPU Node C  |   | GPU Node C  |  | GPU Node C  |
| GPU Node D  |   | GPU Node D  |   | GPU Node D  |  | GPU Node D  |
| Power Shelf |   | Power Shelf |   | Power Shelf |  | Power Shelf |
+-------------+   +-------------+   +-------------+  +-------------+

At the Core: The AI Server Node and Its PCBs

The fundamental building block of the cluster is the AI server. A modern AI server is a marvel of electronic design, heavily reliant on advanced PCB manufacturing techniques.

At the heart of the system is the GPU Baseboard (Universal Baseboard or UBB). In NVIDIA architectures, this is often the HGX board. In open-standard environments, this utilizes the OAM (Open Accelerator Module) standard. These baseboards are massive—often exceeding 18 inches in dimension—and feature extreme layer counts (24 to 40+ layers).

The baseboard must route tens of thousands of high-speed signals between the GPUs. For instance, to facilitate memory sharing and rapid data transfer, the traces must maintain strict impedance control with minimal insertion loss. The materials used are invariably Ultra-Low Loss (ULL) laminates like Rogers, Megtron 7, or Megtron 8.

Above the baseboard sits the CPU Motherboard. While the GPUs handle the massive parallel matrix multiplications, the CPUs (x86 or ARM-based) handle OS operations, storage orchestration, and network dispatching. The connection between the CPU motherboard and the GPU baseboard relies on high-speed PCIe Gen 5 (and soon Gen 6) connectors, requiring their own rigorous signal integrity analysis.

Scale-Up vs. Scale-Out: Interconnecting the Cluster

A single server containing 8 GPUs is powerful, but LLMs require thousands. This introduces the concept of scale-up versus scale-out interconnects, both of which place immense pressure on PCB routing and connector design.

Scale-Up (Within the Server)

Scale-up refers to the high-bandwidth, low-latency connections between GPUs residing in the same server node. Because AI training requires constant sharing of gradients and weights, going through the CPU's PCIe bus is a massive bottleneck.

To solve this, chipmakers use proprietary interconnects. NVIDIA uses NVLink and NVSwitch technology. AMD utilizes Infinity Fabric. These scale-up networks route primarily through the physical layers of the GPU baseboard. Designing a PCB capable of handling hundreds of NVLink differential pairs operating at 112G PAM4 (Pulse Amplitude Modulation 4-level) over FR4/ULL materials is one of the hardest challenges in modern hardware engineering. It requires advanced back-drilling, virtually zero stub lengths, and meticulous fiber-weave skew mitigation.

Scale-Out (Between Servers and Racks)

Once you exceed the limits of a single node (typically 8 GPUs), you must "scale out" across the data center. This involves sending data through Network Interface Cards (NICs)—such as NVIDIA ConnectX or AMD Pensando—up to Top-of-Rack switches, and then to Spine switches.

The dominant protocols for scale-out are InfiniBand (favored for its guaranteed lossless nature and ultra-low microsecond latency) and RoCE v2 (RDMA over Converged Ethernet), which leverages traditional Ethernet hardware optimized for AI workloads. The transition from the server to the switch is handled by transceiver modules (OSFP or QSFP-DD) and active optical cables (AOCs).

Power and Thermal Constraints in High-Density Racks

Building an AI cluster is as much a facilities engineering problem as it is a compute problem. A traditional cloud server rack might draw 10kW to 15kW of power. A modern AI GPU rack draws upwards of 40kW to 120kW.

This immense power draw necessitates specialized Power Delivery Networks (PDN). Traditional 12V distribution is highly inefficient at these amperages due to I2R copper losses. Modern AI clusters distribute 48V (or even 54V) DC via massive copper busbars running down the back of the rack directly to the server nodes. The server's internal PCBs then use complex multiphase Voltage Regulator Modules (VRMs) to step the 48V down to the ~0.8V required by the GPU core, delivering thousands of amps in milliseconds.

With massive power comes massive heat. Air cooling is rapidly reaching its physical limits. Liquid cooling integration is becoming mandatory. This involves Direct-to-Chip (D2C) cold plates mounted directly on the GPU packages, with coolant loops (manifolds) routing liquid into and out of the server chassis. PCB designers must now account for the physical weight of liquid cold plates, ensuring the PCB substrate does not warp, which requires stiffeners and high-Tg materials.

Fabric and Network Infrastructure

The network switches that tie the cluster together are themselves incredible feats of hardware. An 800G or 1.6T Ethernet/InfiniBand switch relies on a massive switch ASIC (like Broadcom Tomahawk or NVIDIA Spectrum).

The PCBs inside these switches must route signals from the central ASIC to the front-panel optical ports. As port speeds push toward 800Gbps, signal loss over copper PCB traces becomes so severe that designers must use "flyover" cables (twinax cables that bypass the PCB entirely) or transition to Co-Packaged Optics (CPO), where the optical engine is brought directly onto the IC substrate next to the switch chip. For the traces that remain on the board, HDI PCB techniques with ultra-smooth copper foils are non-negotiable.

Hardware Comparison: Small vs. Large Scale AI Clusters

Not every enterprise needs a 10,000-GPU supercomputer. The hardware architecture shifts significantly depending on the scale of the deployment. Below is a comparison between a small enterprise cluster and a hyperscale cluster.

Specification Metric Small Enterprise Cluster (Scale-Up Focus) Hyperscale AI SuperPod (Scale-Out Focus)
GPU Count 16 to 64 GPUs (2 to 8 Nodes) 1,024 to 32,000+ GPUs
Primary Use Case Fine-tuning LLMs, inference, localized R&D Pre-training foundation models (GPT-4 class)
Network Topology Single-tier (Leaf only) Multi-tier Non-blocking Fat-Tree (Leaf/Spine/Core)
Scale-Out Fabric 400G Ethernet (RoCE) 800G InfiniBand or custom AI Ethernet
Rack Power Density ~30kW - 40kW per Rack 80kW - 120kW+ per Rack
Cooling Solution High-velocity forced air or rear-door heat exchangers Direct-to-Chip liquid cooling and immersion
Storage Architecture Local NVMe + Basic NAS Distributed Parallel File Systems (e.g., GPFS, Lustre)

Challenges in Manufacturing AI Datacenter Hardware

The blueprint for an AI GPU cluster is only as good as the manufacturing processes that physically build it. The boards underpinning this hardware push the absolute boundaries of current fabrication technology.

When you manufacture a server motherboard or an AI accelerator card, the fabricator must execute perfectly across several demanding vectors:

  • Extreme Layer Counts: 30+ layers are standard. Pressing this many layers of disparate materials without causing delamination or registration errors requires world-class lamination presses and precise temperature profiling.
  • HDI and Via-in-Pad Plated Over (VIPPO): Due to the massive BGA pin counts (often exceeding 8,000 pins on a single ASIC package), advanced PCB capabilities involving multiple lamination cycles, laser-drilled blind/buried vias, and VIPPO are required to route signals out from beneath the chip.
  • Strict Back-drilling: At PAM4 signaling rates (56G/112G), any unused portion of a plated through-hole via (a "stub") acts as an antenna, destroying signal integrity via reflections. Depth-controlled back-drilling is mandatory, with tolerances as tight as ± 4 mils.
  • Impedance Control: AI clusters cannot tolerate dropped packets. Impedance variations across the board must be held to within ± 5% (often ± 8% on standard boards), requiring specialized etching lines and real-time TDR (Time Domain Reflectometry) testing during fabrication.

Frequently Asked Questions (FAQ)

1. What is the difference between a traditional cloud cluster and an AI GPU cluster?

A traditional cloud cluster is designed for high availability and multi-tenancy; workloads are independent, and if a node fails, the user barely notices. An AI cluster runs highly synchronous workloads. If one GPU in a 1,000-GPU training run fails or stalls due to network latency, the entire training process halts. Therefore, AI clusters require vastly superior "east-west" networking (node-to-node) and specialized interconnects.

2. Why do AI clusters use 48V power instead of 12V?

Power (Watts) equals Voltage (V) × Current (I). A single AI server might draw 10,000W. At 12V, that requires 833 Amps of current. Pushing 833 Amps through copper cables and PCB traces generates massive heat (I2R loss) and requires incredibly thick cables. By moving to 48V, the current drops to ~208 Amps, drastically reducing power loss and physical cable bulk inside the rack.

3. Can I build an AI cluster using standard Ethernet?

Yes. While InfiniBand has historically dominated AI due to its built-in lossless characteristics, modern Ethernet paired with RDMA over Converged Ethernet (RoCE v2) has matured significantly. Technologies like the Ultra Ethernet Consortium (UEC) are actively optimizing Ethernet specifically for AI workloads to compete directly with InfiniBand.

4. What role does the PCB substrate play in signal integrity?

As frequencies increase, the resin and glass weave of a standard FR4 PCB absorb signal energy (dielectric loss). For AI clusters, PCBs must use Ultra-Low Loss materials that allow high-frequency signals to travel long distances (up to 20 inches) across the board without degrading beyond the receiver's ability to decode the data.


Ready to Scale Your AI Hardware?

Building the infrastructure for the next generation of artificial intelligence requires manufacturing partners who understand the bleeding edge of PCB fabrication. From 40-layer ULL baseboards to impedance-controlled back-drilled server motherboards, the physical foundation of your cluster must be flawless.

Need to manufacture AI server PCBs? 

Author Name

About the Author

Arya Li, Project Manager at NextPCB.com

With extensive experience in manufacturing and international client management, Arya has guided factory visits for over 200 overseas clients, providing bilingual (English & Chinese) presentations on production processes, quality control systems, and advanced manufacturing capabilities. Her deep understanding of both the factory side and client requirements allows her to deliver professional, reliable PCB solutions efficiently. Detail-oriented and service-driven, Arya is committed to being a trusted partner for clients and showcasing the strength and expertise of the factory in the global PCB and PCBA market.