Contact Us
Blog / PCIe Gen6 PCB Design Challenges: PAM4, FEC and What Changes for Layout Engineers

PCIe Gen6 PCB Design Challenges: PAM4, FEC and What Changes for Layout Engineers

Posted: June, 2026 Last Updated: June, 2026 Writer: Lolly Zheng Share: NEXTPCB Official youtube NEXTPCB Official Facefook NEXTPCB Official Twitter NEXTPCB Official Instagram NEXTPCB Official Linkedin NEXTPCB Official Tiktok NEXTPCB Official Bksy

Introduction

PCIe Gen6 (PCIe 6.0) doubles the per-lane throughput of Gen5 from 32 GT/s to 64 GT/s, delivering approximately 256 GB/s across a ×16 link—twice the bandwidth of Gen5. On paper, this looks like a straightforward generational improvement. In practice, Gen6 achieves the doubling not by increasing the signal frequency (which remains 32 GHz Nyquist, the same as Gen5) but by switching from two-level NRZ (Non-Return-to-Zero) signaling to four-level PAM4 (Pulse Amplitude Modulation). That change has profound consequences for PCB design that go far beyond simply tightening the existing Gen5 rules.

PAM4 encodes two bits per symbol by using four distinct voltage levels. At the same symbol rate as NRZ, PAM4 carries twice the data. But the spacing between voltage levels in PAM4 is one-third the spacing in NRZ at equal signal amplitude. This three-fold reduction in voltage margin means that noise, reflections, crosstalk, and jitter sources that were manageable in Gen5 NRZ become channel-killing problems in Gen6 PAM4—unless every element of the PCB design is tightened accordingly.

PCIe Gen6 is implemented in NVIDIA's B200 GPU (SXM6 form factor) as the host CPU interface, and is expected to become standard across data center AI hardware through 2026 and beyond. Layout engineers who have mastered Gen5 design will find that Gen6 requires a step change in rigor—not just incremental improvements.

  1. Table of Contents

PCIe Gen6 Fundamentals: Speed, PAM4, and the Encoding Change

NRZ vs PAM4: Why the Modulation Changed

Every PCIe generation from Gen1 through Gen5 used NRZ signaling: each symbol period carries one bit, encoded as either a high voltage level (logic 1) or a low voltage level (logic 0). Doubling data rate with NRZ requires doubling the symbol rate, which doubles the Nyquist frequency and doubles the frequency at which the channel must maintain adequate signal quality. By Gen5 (32 GT/s, 16 GHz Nyquist), the PCB channel loss budget was already fully utilized with premium low-loss laminates.

Doubling again to 64 GT/s with NRZ would require 32 GHz Nyquist performance from the PCB channel. At 32 GHz, dielectric and conductor losses are so severe that no commercially available laminate can sustain a channel of practical length (50–100 mm) within a usable loss budget. PCIe Gen6 instead uses PAM4, which keeps the symbol rate at 32 GT/s (16 GHz Nyquist—the same as Gen5) but encodes two bits per symbol by using four amplitude levels: 0, 1, 2, and 3.

The trade-off: PAM4 at the same amplitude swing as NRZ has three times smaller voltage spacing between adjacent levels (the eye height for each of the three eyes in a PAM4 eye diagram is one-third the NRZ eye height). Any noise, jitter, or distortion that would have been comfortably within the NRZ eye opening may now cause bit errors in PAM4.

The PCB design consequence is not a harder version of the same problem—it is a qualitatively different problem where every source of signal degradation (dielectric loss, conductor loss, via reflections, reference plane disruption, crosstalk) must be controlled to a standard that leaves the PAM4 eye open after all impairments are summed.

PCIe Gen6 Key Specifications

Parameter PCIe Gen5 PCIe Gen6
Line rate (GT/s) 32 64
Signaling NRZ (2-level) PAM4 (4-level)
Symbol rate 32 GBaud 32 GBaud
Nyquist frequency 16 GHz 16 GHz
Bits per symbol 1 2
Effective bandwidth per lane ~4 GB/s ~8 GB/s
×16 link bandwidth (bidirectional) ~128 GB/s ~256 GB/s
Encoding 1b/1b (effective) 242b/256b FLIT
FLIT mode No (optional) Yes (mandatory)
FEC Mandatory (CRC-based) Mandatory (stronger; FBER-based)
Differential impedance 85 Ω ± 10% 85 Ω ± 10%
Max channel insertion loss < 28 dB at 16 GHz < 28 dB at 16 GHz (same limit, harder to meet with PAM4 margins)
Voltage margin per eye Full swing ⅓ of NRZ equivalent swing per eye level

The identical Nyquist frequency between Gen5 and Gen6 (both 16 GHz) is a key point: the PCB channel insertion loss specification is nominally the same (< 28 dB at 16 GHz). However, the PAM4 system has one-third the voltage margin of an equivalent NRZ system. This means that channel impairments (reflections, crosstalk, jitter) that consumed say 10% of the Gen5 NRZ eye budget will consume 30% of the Gen6 PAM4 eye budget at the same absolute magnitude. In practice, Gen6 PCB channels must achieve significantly lower insertion loss, return loss, and crosstalk than Gen5 channels to leave adequate PAM4 eye opening after equalization.


Forward Error Correction in PCIe Gen6

FEC Types: CRC and FBER

PCIe Gen6 introduces FLIT (Flow Control Unit) mode as mandatory. A FLIT is a 256-byte packet that includes a CRC for error detection. The Gen6 FEC scheme adds redundancy within each FLIT to enable correction of burst errors that CRC alone would only detect. Two FEC modes are defined in the Gen6 specification:

  • FBER (Forward Bit Error Rate) mode: The primary Gen6 FEC mechanism; targets a post-FEC BER < 10−15; requires pre-FEC BER < 10−3 to achieve the post-FEC target; provides sufficient correction capability for PAM4 channels with moderate pre-FEC error rates
  • CRC-only mode: Error detection without correction; used for very short on-board channels where the pre-FEC BER is expected to be very low (< 10−8); lower latency than full FEC mode

The pre-FEC BER requirement (< 10−3) is dramatically more relaxed than Gen5's pre-FEC channel quality expectation, reflecting the reality that PAM4 channels inherently have higher raw error rates than NRZ channels at the same channel loss. Gen6 FEC is designed to clean up a PAM4 channel that looks poor in raw BER terms but whose errors are correctable with reasonable overhead.

FEC Latency Impact

Gen6 FEC adds more latency than Gen5 FEC because of the larger FLIT size and stronger correction capability:

  • PCIe Gen5 FEC latency: approximately 2–4 ns per direction
  • PCIe Gen6 FEC latency: approximately 4–8 ns per direction (varies by implementation)

For AI server workloads where PCIe is used primarily for model loading, checkpoint writing, and non-latency-critical host-to-GPU data transfers, this additional FEC latency is not a performance concern. For latency-sensitive inference applications where PCIe round-trip time directly affects time-to-first-token, the Gen6 FEC latency contribution should be included in system latency budgets.

What FEC Cannot Fix

FEC is designed to correct random and burst errors up to the pre-FEC BER limit. It does not fix:

  • Link training failures: If the channel loss is so high that the receiver's CDR (Clock and Data Recovery) circuit cannot lock to the transmitted signal during link training, the PCIe link will not train at Gen6 speed regardless of FEC capability
  • Systematic deterministic jitter: Periodic jitter caused by power supply noise coupling, crosstalk from aggressor signals at specific frequencies, or resonances in the via structure creates deterministic eye closure that FEC cannot correct because the errors are correlated, not random
  • Sustained BER above the FEC input limit: If pre-FEC BER consistently exceeds 10−3, the FEC correction capacity is overwhelmed and post-FEC BER rises rapidly; the channel effectively fails

These limitations underscore why FEC is not a substitute for careful PCB design—it is a safety net for residual random errors after a well-designed channel has already minimized systematic impairments.


Channel Loss Budget: Tighter Than Gen5

Insertion Loss Limit

The PCIe Gen6 specification nominally maintains the same 28 dB insertion loss limit at 16 GHz as Gen5. However, this parity is deceptive. The PAM4 system requires that the channel deliver adequate signal quality for the receiver's equalizer to separate four voltage levels, not two. The effective channel insertion loss that leaves a usable PAM4 eye after equalization is substantially lower than the specification limit:

  • A Gen5 NRZ channel with 26 dB insertion loss at 16 GHz has adequate eye opening for reliable link operation with modern equalizers
  • A Gen6 PAM4 channel with 26 dB insertion loss at 16 GHz has a much smaller eye opening per level (one-third the NRZ equivalent) and may be borderline or unreliable depending on the equalizer implementation and the distribution of loss across the frequency band
  • In practice, Gen6 PCB channels are designed to achieve 20–24 dB or less at 16 GHz, leaving additional margin to keep the PAM4 eye open after all impairments are accounted for

Return Loss and Mode Conversion

PAM4 channels are more sensitive to return loss (impedance discontinuities) than NRZ channels because reflections cause inter-symbol interference (ISI) that affects all four voltage levels simultaneously, creating pattern-dependent eye closure that is difficult for equalizers to fully compensate. Gen6 return loss requirements:

  • Return loss < −9 dB at frequencies from 50 MHz to 16 GHz (same as Gen5 in dB, but more stringent in effect because PAM4 has less margin to absorb reflections)
  • Differential-to-common mode conversion (Sdc21) < −35 dB; mode conversion is particularly damaging to PAM4 because the converted common mode signal appears as a noise floor that affects all eye levels equally
  • Via structures, connector transitions, and impedance neck-downs at BGA launches are the primary return loss contributors; all of these must be designed more carefully for Gen6 than for Gen5

Loss Budget Allocation

Loss Contributor Gen5 Typical Gen6 Target (practical) Notes
PCB trace (dielectric) 10–18 dB 8–14 dB Shorter channels or better material required
PCB trace (conductor / skin) 3–8 dB 2–5 dB HVLP copper foil strongly recommended
Via structures 1–4 dB 0.5–2 dB Tighter stub control; optimized anti-pad geometry
Connector loss 0.5–3 dB 0.5–2 dB Gen6-rated connectors only; verify S-params at 16 GHz
Package launch 1–2 dB 0.5–1.5 dB Optimized pad geometry; use vendor S-parameter model
Practical total target < 28 dB (spec limit) < 20–24 dB (design target for PAM4 margin) Leave ≥ 4–8 dB margin vs spec limit for PAM4 eye adequacy

Differential Impedance: 85 Ω with Tighter Tolerance

The Gen6 differential impedance target remains 85 Ω, the same as Gen5. The specified tolerance is also ± 10%. However, the effective tolerance that keeps the PAM4 channel within margin is tighter in practice:

  • A ± 10% impedance excursion (75.5–93.5 Ω) creates a reflection coefficient of approximately −20 dB in NRZ; in Gen5, this is acceptable with adequate margin remaining
  • In Gen6 PAM4, the same −20 dB reflection contributes ISI noise proportionally to all four signal levels; because the level spacing is one-third of NRZ, the relative impact on eye margin is approximately three times larger
  • Best practice for Gen6 is to design for ± 5% impedance variation (80.75–89.25 Ω), consistent with the tighter manufacturing tolerances used in NVLink 4.0 and NVLink 5.0 boards (see What Is NVLink? How NVIDIA's High-Speed GPU Interconnect Shapes PCB Routing)
  • Fabricators must be capable of ± 5% impedance control at 85 Ω; this requires LDI (Laser Direct Imaging) for trace patterning, tight prepreg thickness control (± 3%), and TDR verification on every production panel

Laminate Selection: Ultra-Low-Loss Required

Dielectric Loss at 16 GHz (PAM4 Nyquist)

The Gen6 Nyquist frequency (16 GHz) is identical to Gen5. However, because Gen6 requires a smaller practical channel loss budget (20–24 dB design target vs < 28 dB spec limit) to maintain PAM4 eye margin, the effective laminate quality requirement is stricter:

  • Megtron 6 (Df ~0.004): Borderline for Gen6 channels longer than 100 mm; acceptable for very short (< 80 mm) on-board Gen6 channels with minimal via transitions; not recommended as the primary Gen6 signal laminate
  • Megtron 6E / Tachyon 100G (Df ~0.002–0.003): Minimum recommended for Gen6 signal layers; suitable for channels up to approximately 150–180 mm with careful via design
  • Megtron 7 (Df ~0.002): Best practice for Gen6; provides adequate margin for channels up to 200 mm; used on B200 boards for both NVLink 5.0 and PCIe Gen6 layers
  • Rogers ultra-low-loss grades (RO4350B, RO3003): Excellent loss performance; higher cost and more challenging to process in sequential lamination compared to Megtron 7; used in specific high-performance applications

Copper Foil: HVLP Becomes Best Practice

Conductor loss at 16 GHz is significant for both Gen5 and Gen6, but the reduced PAM4 margin in Gen6 means that every dB saved on conductor loss directly translates to PAM4 eye margin:

  • Standard ED copper (Rz ~6–10 μm): Not suitable for Gen6 signal layers
  • Low-profile (LP) copper (Rz ~2–4 μm): Acceptable for Gen5; marginal for Gen6; use only on Gen6 channels with tight budgets if better foil is unavailable
  • Very-low-profile (VLP) copper (Rz ~1–2 μm): Recommended minimum for Gen6 signal layers; the standard for AI server boards with NVLink 4.0 and PCIe Gen5
  • High-VLP (HVLP) copper (Rz < 1 μm): Best practice for Gen6 and NVLink 5.0 co-routed boards; reduces skin-effect conductor loss by 0.5–1 dB on a 150 mm trace compared to VLP; mandatory on NVLink 5.0 layers in B200 designs (see NVIDIA Blackwell Architecture Explained)

Material Comparison Table

Laminate Df (10 GHz) Copper Foil Gen6 Suitability Max Practical Channel Length (Gen6)
Standard FR4 ~0.020 Standard ED Not suitable < 30 mm (unusable)
Panasonic Megtron 6 ~0.004 LP Marginal; very short channels < 80 mm with optimized vias
Isola Tachyon 100G ~0.0021 VLP Good Up to 150–180 mm
Panasonic Megtron 6E ~0.0024 VLP Good Up to 150–170 mm
Panasonic Megtron 7 ~0.0020 HVLP Best practice Up to 200 mm; used in B200 boards
Rogers RO4350B ~0.0037 VLP Conditional (check budget) Up to 120 mm with careful via design

Via Design for PCIe Gen6

Stub Resonance: Even Stricter Than Gen5

Via stub resonance creates insertion loss notches in the channel frequency response. For Gen5 NRZ, a notch at or above the Nyquist frequency (16 GHz) was acceptable if the notch depth was within the insertion loss budget. For Gen6 PAM4, even a partial notch whose skirt increases loss at 12–16 GHz by 2–3 dB can be sufficient to close the PAM4 eye.

The required minimum stub resonance frequency for Gen6 is higher than for Gen5 to keep the notch skirt effects below 12 GHz:

  • For Gen5: stub length < 40 mils (< 1 mm) to push resonance above 40 GHz; impact on Gen5 channel < 0.5 dB
  • For Gen6: stub length < 20 mils (< 0.5 mm) to push resonance above 80 GHz; even smaller stubs (< 10 mils) are preferred to keep skirt effects below 1 GHz

For Gen6 PCB channels, through-hole vias with residual stubs after backdrilling are acceptable only if the stub length is < 0.5 mm (approximately 20 mils). Achieving this with standard ± 50 μm backdrill depth accuracy requires very careful stackup design to ensure that the signal layer is sufficiently close to the drilled-from face to leave an acceptable stub.

For the most loss-sensitive Gen6 channels (maximum length, multiple layer transitions), laser-drilled blind/buried microvias that eliminate stubs entirely are the preferred solution. This is consistent with the HDI technology used on B200 boards for both NVLink 5.0 and PCIe Gen6 routing. For HDI design details, see Why AI GPUs Require 30+ Layer HDI PCBs.

Backdrilling Requirements

  • Target stub length: < 10 mils (250 μm); ideally < 5 mils (125 μm) for maximum PAM4 margin
  • Backdrill depth accuracy: ± 25–50 μm (tighter than the ± 50 μm acceptable for Gen5); requires per-panel depth file generated from actual as-built stackup measurements
  • Backdrill diameter: Original via drill + 0.3–0.4 mm; larger than the Gen5 practice to ensure complete stub removal including any residual plating at the barrel wall junction
  • Verification: TDR measurement on representative board locations (not just edge coupons) to verify stub length; VNA insertion loss measurement on channel coupons to verify that stub contribution is within budget

Via Pad and Anti-Pad Geometry

For Gen6, via pad and anti-pad geometry optimization follows the same principles as Gen5 but with tighter targets:

  • Differential via pair spacing: Center-to-center spacing 0.6–0.9 mm (tighter than Gen5's 0.8–1.2 mm) to maintain differential coupling through the via transition and reduce common-mode conversion; the return loss from mode conversion (differential to common mode) is more damaging in PAM4 than in NRZ
  • Anti-pad shape: Merged oval anti-pad for differential via pairs, sized to minimize shunt capacitance while maintaining adequate reference plane coverage around the via; Gen6 optimal anti-pad diameter is typically 0.1–0.2 mm smaller than Gen5 practice to reduce the reference plane void area
  • Ground stitching vias: Four ground stitching vias surrounding each signal via pair, placed at 0.8–1.2 mm from signal via center; critical for both return loss and mode conversion control in Gen6
  • Via-in-pad preference: For Gen6 channels in AI server boards, via-in-pad with laser-drilled microvias is strongly preferred over through-hole vias with backdrilling wherever the HDI stackup permits; eliminates stub entirely and provides cleaner via geometry with lower return loss contribution

Equalization: Tx and Rx Side Requirements

Transmitter Equalization (TXEQ)

PCIe Gen6 transmitters use Feed-Forward Equalization (FFE) with multiple pre-cursor and post-cursor taps to pre-distort the transmitted signal so that it arrives at the receiver with a more open eye after the channel's frequency-dependent attenuation. Gen6 TXEQ requirements:

  • Minimum 3 taps (c−1, c0, c+1): pre-cursor, main cursor, post-cursor
  • Optional extended taps for additional ISI compensation on long or complex channels
  • TXEQ settings are negotiated during link training via LTSSM (Link Training and Status State Machine); each setting corresponds to a different pre-distortion profile that the PCIe Link Equalization protocol evaluates for that specific channel

The important PCB design implication: TXEQ can compensate for smooth, frequency-monotonic channel attenuation (the type caused by dielectric loss and conductor loss), but it is less effective at compensating for notches in the insertion loss response caused by via stub resonances or abrupt impedance discontinuities. A channel with a smooth 22 dB insertion loss may be more trainable than a channel with 18 dB average loss but a 3 dB notch at 14 GHz.

Receiver Equalization: CTLE, DFE, and Slicer

Gen6 receivers combine multiple equalization stages:

  • CTLE (Continuous Time Linear Equalizer): An analog filter in the receiver front end that boosts high-frequency content to compensate for channel roll-off; typically provides 10–20 dB of boost at 16 GHz; the first stage of equalization, applied before analog-to-digital conversion
  • DFE (Decision Feedback Equalizer): A digital equalizer that uses previously decoded symbols to remove ISI caused by post-cursor reflections and multipath in the channel; effective for removing deterministic post-cursor ISI that CTLE does not address
  • PAM4 slicer: The decision circuit that maps the equalized signal to one of four voltage levels (0, 1, 2, 3); slicer thresholds are adaptively adjusted during link operation to center them between the received eye levels; the slicer's ability to adapt is limited by how much noise and ISI remain after CTLE and DFE

How Equalization Capability Interacts with PCB Design

Equalization is more capable on channels with smooth frequency-dependent loss than on channels with resonances or mode conversion. PCB design decisions that affect equalizability:

  • Smooth insertion loss vs. notched: A channel with Megtron 7 laminate (smooth, frequency-monotonic loss) and well-designed vias (no significant stub resonances) is easily equalized by CTLE; a channel with stub resonances creating notches at 12–16 GHz requires the equalizer to compensate for both frequency roll-off and notch distortion simultaneously, often exceeding equalization capability
  • Mode conversion: Differential-to-common mode conversion appears as an effectively uncorrectable noise floor at the PAM4 slicer; DFE and CTLE operate on the differential signal and cannot cancel common mode noise; reference plane disruptions, asymmetric via structures, and unmatched differential pair geometries are the primary PCB sources of mode conversion
  • Crosstalk: Near-end and far-end crosstalk from adjacent lanes appears as noise at the PAM4 slicer; CTLE and DFE are not designed to cancel crosstalk (unlike MIMO equalizers used in some other standards); the crosstalk must be kept within the ICN/ICF specification limits by adequate lane spacing in the PCB layout

Routing Rules: What Changes from Gen5

Maximum Trace Length

The maximum practical PCIe Gen6 trace length (without a retimer) on a well-designed board is shorter than Gen5:

  • Gen5 (NRZ): Up to 200–250 mm on Megtron 6E with VLP copper and optimized vias
  • Gen6 (PAM4): Up to 150–200 mm on Megtron 7 with HVLP copper and via-in-pad or heavily backdrilled vias; channels approaching 200 mm require careful pre-layout simulation to verify adequacy

For AI server baseboards where the CPU-to-GPU distance may be 200–300 mm, Gen6 channels of this length almost always require a PCIe retimer. This is consistent with the design of B200 NVL72 and GB200 architectures where retimers are commonly used on PCIe Gen6 channels of moderate length.

Intra-Pair Skew

Intra-pair skew requirements for Gen6 are tighter than Gen5:

  • Gen5: Intra-pair skew < 1.5 ps per segment; < 5 ps full channel
  • Gen6: Intra-pair skew < 1.0 ps per segment; < 3 ps full channel

The tighter requirement reflects the PAM4 system's greater sensitivity to any signal that converts differential mode to common mode. A 1 ps skew between P and N at 32 GBaud creates approximately −30 dB of differential-to-common mode conversion, which is at the limit of the Gen6 mode conversion budget.

Achieving < 1 ps intra-pair skew requires:

  • Trace length matching within approximately 0.17 mm (corresponding to ~1 ps propagation in Megtron 7 at ~17 cm/ns)
  • Symmetric via pad geometry for both P and N conductors (identical pad sizes and anti-pad dimensions to ensure equal via capacitance)
  • Avoiding asymmetric BGA pad entry geometry where one conductor exits at a different angle than the other

Lane Spacing and Crosstalk

Gen6 PAM4 crosstalk requirements are functionally more stringent than Gen5 NRZ because the PAM4 eye has less absolute voltage margin to absorb crosstalk noise:

  • Minimum edge-to-edge spacing between differential pairs: 4W rule (4 × trace width) for Gen6, compared to 3W for Gen5; at 100 μm trace width, minimum 400 μm edge-to-edge between adjacent Gen6 differential pairs
  • TX/RX isolation: PCIe Gen6 TX and RX traces must be routed on separate layers separated by a ground plane; routing TX and RX on the same layer with only trace spacing separation is not adequate for Gen6
  • Aggressor signals from NVLink: On AI server boards where PCIe Gen6 and NVLink 5.0 traces share the stackup (B200 boards), maintain ≥ 6W separation between NVLink and Gen6 routing groups, or route them on separate layers with a ground plane between them

Reference Plane Continuity

Reference plane disruptions are more damaging to Gen6 PAM4 than Gen5 NRZ because any impedance discontinuity or mode conversion caused by plane disruption reduces the already-tight PAM4 eye margin. Gen6 reference plane rules are stricter:

  • No power plane splits, via anti-pad arrays, or connector cutouts within 2 mm of any active Gen6 trace (vs 1 mm for Gen5)
  • Reference plane changes at layer transitions require ground stitching vias within 0.5 mm of the signal via (vs 1 mm for Gen5)
  • Ground planes adjacent to Gen6 signal layers must be dedicated ground (not split power planes); power planes sharing a layer with ground areas introduce impedance variation where the reference transitions from ground to power potential

Retimers in PCIe Gen6 Systems

A PCIe retimer is a signal conditioning chip that receives, re-equalizes, retimes, and retransmits the PCIe signal partway through a long channel. It effectively splits a single long channel into two shorter channels, each within the loss budget independently.

For PCIe Gen6, retimers are more frequently required than for Gen5 because the effective maximum channel length is shorter due to PAM4 margin constraints. Key design considerations when using Gen6 retimers:

  • Retimer placement: Placed at the midpoint of the channel (approximately equal length on each side) to minimize the maximum insertion loss in either channel segment; for a 300 mm total channel, the retimer should be placed at approximately 150 mm from each endpoint
  • Retimer power and PDN: Gen6 retimers consume 1–3 W each; with multiple PCIe ×16 channels on a baseboard, total retimer power can be significant and must be included in the board PDN design
  • Retimer PCB routing: The PCB traces from the retimer to the downstream device must independently meet the Gen6 channel loss budget; retimer output pads, package launch vias, and traces to the downstream component are all subject to the same rules as the primary channel
  • Latency: Each retimer adds approximately 2–5 ns of additional latency; for AI server applications, two retimers in a long channel add 4–10 ns per PCIe direction, which is generally acceptable

PCIe Gen5 vs Gen6: Full PCB Design Comparison

Design Parameter PCIe Gen5 (NRZ) PCIe Gen6 (PAM4)
Line rate 32 GT/s 64 GT/s
Signaling NRZ PAM4
Nyquist frequency 16 GHz 16 GHz (same)
Effective voltage margin per eye Full NRZ swing ⅓ NRZ swing per PAM4 eye
Insertion loss spec limit < 28 dB at 16 GHz < 28 dB at 16 GHz (same spec, tighter practical target)
Practical design target IL < 26 dB < 20–24 dB (need PAM4 eye margin)
Differential impedance 85 Ω ± 10% 85 Ω ± 10% (target ± 5% in practice)
Minimum laminate Df ~0.003 (Megtron 6E) ~0.002 (Megtron 7)
Copper foil minimum LP (VLP recommended) VLP minimum; HVLP best practice
Via stub target length < 10 mils < 5 mils; < 10 mils absolute max
Backdrill depth accuracy ± 50 μm ± 25–50 μm (tighter preferred)
Intra-pair skew < 1.5 ps per segment < 1.0 ps per segment
Lane spacing (edge-to-edge) 3W rule minimum 4W rule minimum
TX/RX layer separation Strongly recommended (same layer possible) Required (separate layers with ground plane between)
Reference plane gaps allowed > 1 mm from traces > 2 mm from traces
Max channel length (no retimer) 200–250 mm 150–200 mm
Retimer need Optional for long channels Commonly required > 150–200 mm
FEC type CRC-based; pre-FEC BER < 10−4 FBER-based; pre-FEC BER < 10−3 acceptable
Simulation requirement Recommended for long channels Required for all channels > 100 mm
Manufacturing complexity High Very high

PCIe Gen6 in AI Server Hardware: B200 and Beyond

NVIDIA's B200 GPU (Blackwell, SXM6 form factor) uses PCIe Gen6 ×16 as its primary host CPU interface, providing approximately 256 GB/s of bidirectional CPU-to-GPU bandwidth. In the GB200 Superchip, the Grace CPU connects to the B200 GPU via NVLink-C2C (900 GB/s coherent interconnect) rather than PCIe, but external PCIe Gen6 connections to other system components (NIC, storage, management) are still present.

On B200 baseboards, PCIe Gen6 signal routing coexists with NVLink 5.0 in the same stackup. Since NVLink 5.0 already demands Megtron 7 laminate and HVLP copper on its routing layers, PCIe Gen6 benefits automatically from the board's premium material selection. The additional constraint is routing isolation: NVLink 5.0 differential pairs running at 200 Gb/s per lane generate electromagnetic fields that must be shielded from adjacent PCIe Gen6 traces by adequate spacing (≥ 6W between routing groups) or separate layer assignment. For a complete treatment of B200 board design constraints, see NVIDIA Blackwell Architecture Explained: B200, GB200 & PCB Design Impact.

Looking beyond B200, PCIe Gen6 is expected to become the standard AI server accelerator host interface through 2026–2027, with PCIe Gen7 (128 GT/s per lane, likely PAM4 again at higher symbol rate) appearing on roadmaps for 2028+. Layout engineers investing in Gen6 design mastery—especially the transition from NRZ to PAM4 thinking and the associated PCB design discipline—are building skills that will apply through at least the next two PCIe generations.


Simulation Requirements for PCIe Gen6

Pre-layout simulation is not optional for Gen6 channels longer than 100 mm. The reduced PAM4 eye margin makes it essential to verify that the channel meets requirements before committing to a board design. The recommended simulation workflow:

  1. 2D field solver (stackup simulation): Verify differential impedance (85 Ω ± 5% target) and per-unit-length insertion loss at 16 GHz for the proposed trace geometry and Megtron 7 (or chosen) laminate; confirm that the trace loss contribution is within budget for the planned channel length
  2. 3D via simulation: Full 3D electromagnetic model of each via transition structure (pad, anti-pad, stub, ground stitching vias, package launch); extract S-parameters from 100 MHz to at least 30 GHz; verify that via insertion loss and return loss contributions are within the Gen6 budget
  3. Full channel simulation: Cascade all S-parameter models (trace, each via, connector, package launch) into the full channel; verify insertion loss, return loss, and differential-to-common mode conversion against Gen6 specification limits; verify that practical insertion loss is < 22–24 dB (not just < 28 dB) to maintain PAM4 eye margin
  4. PAM4 eye simulation: Time-domain simulation using IBIS-AMI models for transmitter and receiver; verify that the equalized PAM4 eye diagram meets the minimum eye height and width requirements at the receiver input; this step is the ultimate validation that the channel is trainable at Gen6 speed
  5. Statistical margin analysis: Monte Carlo simulation over manufacturing tolerances (trace width ± 8 μm, dielectric thickness ± 3%, Dk ± 0.05, stub length ± 50 μm) to verify that > 99.7% of production channels (3σ) meet the Gen6 eye specification

FAQ

Why does PCIe Gen6 use PAM4 instead of simply doubling the symbol rate?
Doubling the Gen5 symbol rate to 64 GBaud would require 32 GHz Nyquist performance from the PCB channel. At 32 GHz, dielectric and conductor losses in any commercially available laminate are so high that even a 50 mm channel would consume the full insertion loss budget. PAM4 at 32 GBaud (16 GHz Nyquist—same as Gen5) achieves double the data rate by using four voltage levels rather than two, keeping the channel frequency requirements within range of achievable PCB designs—albeit with much reduced noise margin compared to NRZ.

Can a PCIe Gen5-rated PCB be used for Gen6?
It depends on the channel length and the specific materials used. A PCIe Gen5 board designed with Megtron 7 and HVLP copper, < 5 mil via stubs, and < 150 mm channels may meet the PCIe Gen6 practical channel requirements, particularly if a retimer is used for longer segments. A PCIe Gen5 board designed with Megtron 6 and LP copper and relying on the full 28 dB budget will not support Gen6 without redesign. The key question is whether the channel achieves < 20–24 dB insertion loss at 16 GHz with adequate PAM4 eye margin—not whether it meets the Gen5 specification.

Is backdrilling still required for PCIe Gen6 if I use HDI microvias?
No. Laser-drilled blind/buried microvias have no stubs by definition and do not require backdrilling. One of the key benefits of HDI microvia technology for Gen6 is that it eliminates the stub resonance problem entirely on the via transitions that use microvias. Through-hole vias on Gen6 signal layers still require aggressive backdrilling (< 5 mil stub target) if microvias cannot be used at that location.

What is FLIT mode and why is it mandatory in Gen6?
FLIT (Flow Control Unit) mode restructures PCIe transactions into fixed 256-byte packets (FLITs) that carry their own CRC and are managed as atomic units. FLIT mode is mandatory in Gen6 because it enables the Gen6 FEC mechanism to operate on well-defined packet boundaries, and because the 242b/256b encoding used in FLIT mode is more efficient than the per-byte overhead of earlier PCIe encoding schemes. Gen5 introduced optional FLIT mode; Gen6 makes it non-optional.

How many retimers does a B200 baseboard typically need for PCIe Gen6?
It depends on the board layout. In a DGX B200-class system where the host CPU and GPU packages are on the same baseboard with PCIe Gen6 channels of 150–250 mm, one retimer per ×16 link is typically required. A system with 8 B200 GPUs and 2 CPUs could require 8 or more PCIe Gen6 retimers, each consuming 1–3 W of power and requiring PCB area for placement and routing. Retimer count is a meaningful cost and complexity driver in B200 baseboard design.

What is the pre-FEC BER target for a PCIe Gen6 channel?
The PCIe Gen6 FEC mechanism is designed to correct burst errors when the pre-FEC BER is < 10−3. In practice, a well-designed channel should achieve significantly better than this: pre-FEC BER of 10−6 to 10−5 is a reasonable design target that leaves adequate headroom above the FEC correction limit for process variation and aging. A channel operating routinely near the 10−3 FEC correction limit is marginal and may degrade over the product lifetime as connector and board aging increase channel loss.


Need to Manufacture PCIe Gen6 or AI Server PCBs?

PCIe Gen6 PAM4 boards demand the tightest combination of laminate quality, copper foil selection, via stub control, and impedance precision available in commercial PCB manufacturing. NextPCB supports Megtron 7 and ultra-low-loss laminate processing, HVLP copper, ± 5% impedance control, precision backdrilling to < 5 mil stubs, HDI microvia technology, and complete PCBA services for AI server and data center PCB programs.

Get a quote from NextPCB →


Related Articles:

Author Name

About the Author

Lolly Zheng- Sales Account Manager at NextPCB.com

Four years of proven sales experience across electronic components and PCBA industries, with strong expertise in key account acquisition, customer relationship management, and contract negotiations. Focused on driving revenue growth through strategic client development and solution-based selling. Experienced in expanding high-value accounts, securing long-term partnerships, and consistently exceeding sales targets in competitive markets.