HPC Cooling Technologies: Air Cooling, Direct Liquid Cooling, and Immersion Cooling
HPC cooling technology comparison: why cooling is critical for HPC (thermal throttling), component temperature limits, air cooling advantages and limitations, direct liquid cooling circuit design, rear-door heat exchangers, immersion cooling (single-phase vs two-phase), decision criteria, and monitoring commands.
Cooling is not a peripheral concern in HPC infrastructure — it is a primary constraint. Modern GPU nodes (DGX H100, HGX H100) dissipate 10–14 kW per 4U chassis. A 42U rack of GPU servers can generate 80–120 kW of heat. Air cooling at this density is physically impractical. Even for CPU-only clusters, thermal management directly affects performance: processors that exceed their temperature limits automatically reduce clock speed (thermal throttling), degrading simulation throughput with no warning.
Why Thermal Management Is Critical for HPC
Thermal throttling: Modern processors (Intel Xeon, AMD EPYC) and GPUs implement hardware thermal protection by reducing operating frequency when die temperature approaches the maximum specified value (TjMax for CPUs, typically 95°C; NVML thermal shutdown for GPUs at 90°C). A throttling processor delivers less compute per watt than a properly cooled one. An HPC node that throttles for 10% of its operating time delivers only 90% of its purchased compute capacity.
Component temperature limits:
| Component | Normal Operating | Warning | Critical (throttle/shutdown) |
|---|---|---|---|
| Intel/AMD CPU die | 55–75°C | 80°C | 90–95°C |
| NVIDIA H100 GPU | 50–70°C | 80°C | 88°C (throttle), 90°C (shutdown) |
| NVMe SSD | 40–55°C | 65°C | 70°C |
| InfiniBand HCA | 40–60°C | 70°C | 80°C |
| Memory (DDR5) | 40–55°C | 85°C | 95°C |
| Power supply | 40–55°C | 70°C | 80°C |
Reliability impact: The Arrhenius equation shows that every 10°C reduction in operating temperature approximately doubles component lifetime (the rule of thumb for semiconductor failure rates). Operating GPU clusters at 70°C inlet temperature (allowable in A2 ASHRAE class) vs. 25°C shortens DRAM and capacitor lifetime significantly.
Air Cooling
Air cooling moves heat from servers to room air, which is then cooled by Computer Room Air Conditioning (CRAC/CRAH) units.
How it works: Servers draw cool air from the front (cold aisle) and exhaust hot air from the rear (hot aisle). CRAC units supply conditioned air to the cold aisle and return hot exhaust. Hot/cold aisle containment (physical barriers separating the two air streams) is essential — without containment, hot and cold air mix, forcing CRAC units to over-cool to compensate.
Power density limit: Air cooling becomes impractical above approximately 15–20 kW per rack for standard raised-floor deployments, or 25–30 kW with high-velocity perforated tiles and precision containment. A 4U DGX H100 at 14 kW is already challenging for air cooling in a shared row.
Advantages:
- Lowest infrastructure cost for moderate density
- Standard IT operations — no coolant plumbing
- Immediate scalability (just add more CRAC capacity)
Limitations:
- PUE rarely below 1.4 (40% overhead for cooling infrastructure)
- Cannot handle > 20–30 kW/rack cost-effectively
- High airflow creates significant noise (75+ dBA) and vibration
Direct Liquid Cooling (DLC)
Direct liquid cooling delivers chilled water directly to the server chassis, where cold plates in contact with CPUs and GPUs transfer heat to the coolant. The heated coolant is returned to a building chiller or cooling tower.
DLC cooling circuit:
Building chilled water supply (10–18°C)
↓
Manifold / CDU (Cooling Distribution Unit)
↓
Flexible hoses to rack rear
↓
In-server manifold
↓
Cold plates on CPU / GPU / memory
↑
Return manifold → CDU → building return loop
The CDU is an intermediate heat exchanger that separates the building water from the server water, handling pressure differential and preventing building water contamination from entering servers.
Thermal performance: A properly designed DLC system maintains CPU and GPU junction temperatures 15–25°C lower than air cooling under the same load, completely eliminating thermal throttling on high-density HPC workloads.
Rear-door heat exchanger (RDHx): A simpler variant of DLC where a passive or active heat exchanger in the rack rear door absorbs server exhaust heat. No cold plates on CPUs/GPUs are required; existing servers work without modification. Effective to approximately 30–50 kW per rack.
Advantages of DLC:
- Handles 50–150 kW per rack
- PUE approaching 1.1–1.2 (versus 1.4–1.6 for air)
- Eliminates server fan noise (some DLC servers run fanless)
- Higher server inlet temperature is allowed (reduces cooling energy)
Installation requirements:
- Chilled water distribution infrastructure in the data center
- CDU units (one per rack row, or one per rack)
- Qualified plumbing for installation and maintenance
- Leak detection system around server manifolds
Immersion Cooling
Immersion cooling submerges servers directly in electrically non-conductive dielectric fluid. The fluid absorbs heat from all components simultaneously — not just CPUs and GPUs but also VRMs, capacitors, and PCIe cards that are unreachable by cold plates.
Single-phase immersion: Servers submerge in a fluid (engineered fluids from 3M Novec or similar) that remains liquid throughout the process. A pump circulates fluid from the bath through an external heat exchanger.
Two-phase immersion: Servers submerge in a fluid with a lower boiling point (~49°C for 3M Novec 7100). The fluid boils at the hot components, vapor rises, condenses on a water-cooled condenser at the top of the tank, and falls back as liquid. No pump needed for fluid circulation.
| Feature | Single-Phase | Two-Phase |
|---|---|---|
| Fluid cost | Lower | Higher |
| PUE | ~1.05 | ~1.02–1.03 |
| Component removal | Easy | Requires cool-down |
| Maintenance | Standard practices | Specialized skills |
| Fluid recovery | Simple | Requires vapor capture |
| Power density | Up to 100 kW/tank | Up to 250 kW/tank |
| Server compatibility | Standard servers (modified) | Standard servers (dip tanks) |
Immersion cooling advantages:
- Lowest PUE of any cooling approach (1.02–1.05)
- Servers can run overclocked beyond TDP specifications
- No air circulation required — significant noise reduction
- Components can last 2–3× longer at lower temperatures
- Highest possible rack density
Immersion cooling challenges:
- Significant infrastructure investment (tanks, fluid, secondary cooling)
- Fluid is expensive and must be managed carefully
- Not all hardware is certified for immersion
- Fluid contamination from component failures requires costly fluid changes
- Not suitable for environments where frequent hardware changes are expected
Decision Criteria
| Scenario | Recommended Cooling |
|---|---|
| < 10 kW/rack, existing data center | Air cooling with hot/cold aisle containment |
| 10–30 kW/rack, new deployment | Rear-door heat exchanger or DLC |
| 30–100 kW/rack (dense GPU cluster) | Direct liquid cooling |
| > 100 kW/rack, maximum efficiency | Single-phase immersion |
| Maximum performance, greenfield | Two-phase immersion |
Monitoring Thermal Metrics
# CPU temperatures via lm-sensors
sensors | grep -E "Core|Package|temp"
# GPU temperatures via nvidia-smi (all GPUs)
nvidia-smi --query-gpu=index,name,temperature.gpu --format=csv
# Continuous GPU monitoring with DCGM
dcgmi dmon -e 203,252,225 -d 5
# 203 = GPU temperature, 252 = power draw, 225 = SM utilization
# IPMI outlet and ambient temperature
ipmitool sdr type Temperature
# Alert rules in Prometheus (GPU temperature)
# gpu_alert.yml
groups:
- name: gpu_thermal
rules:
- alert: GPUTemperatureHigh
expr: DCGM_FI_DEV_GPU_TEMP > 85
for: 5m
labels:
severity: warning
annotations:
summary: "GPU {{ $labels.gpu }} on {{ $labels.instance }} at {{ $value }}°C"
- alert: GPUTemperatureCritical
expr: DCGM_FI_DEV_GPU_TEMP > 88
for: 2m
labels:
severity: critical
annotations:
summary: "GPU {{ $labels.gpu }} approaching thermal shutdown"
Cooling infrastructure is a long-term commitment that constrains which compute hardware can be deployed. The transition from air to liquid cooling for high-density GPU clusters is not a future concern — it is a present requirement for any deployment above 20–30 kW per rack. Plan cooling before finalizing server specifications. Contact Mevasis for HPC data center cooling design and thermal management consulting.