+49 6430 9227117
NVIDIA DGX · TPM FOR AI COMPUTE

NVIDIA DGX Maintenance — vendor-independent service for DGX A100 to GB200

We service current NVIDIA DGX AI platforms vendor-independent — DGX A100 (320GB/640GB), DGX H100, DGX H200, DGX GH200 (Grace Hopper), DGX GB200 NVL72 (Grace Blackwell). With OEM components from our own warehouse and certified refurbishing sources, SLA up to 24×7×4. NVIDIA factory warranty and standard support are the most expensive OEM maintenance models in the data center — typically 15-25 percent of hardware value per year. TPM reduces this to 30-60 percent.

Which DGX models we service

DGX platforms are NVIDIA's reference-design AI servers for training and inference — complete system boxes with 8 GPUs (DGX A100/H100/H200), Grace Hopper superchips (GH200), or Grace Blackwell with 72 GPUs in a rack configuration (GB200 NVL72). Hardware is proprietary (NVLink topology, NVSwitch backplane, NVIDIA-specific power delivery), maintenance requires specialized engineer expertise.

DGX A100 · Ampere generation
DGX A100 320GB (8x A100 40GB) · DGX A100 640GB (8x A100 80GB)
DGX H100 / H200 · Hopper generation
DGX H100 (8x H100 SXM5 80GB) · DGX H200 (8x H200 141GB)
DGX GH200 · Grace Hopper
DGX GH200 (Grace+H100, 256x GH200 superchips, 144TB memory)
DGX GB200 NVL72 · Grace Blackwell
DGX GB200 NVL72 (72x B200 GPU + 36x Grace, liquid-cooled rack-scale)
Components
GPU boards · NVLink/NVSwitch · power supplies · cooling loop (GB200) · networking modules

Why TPM maintenance for NVIDIA DGX

DGX maintenance is the absolute highest-value lever in the TPM market. NVIDIA factory warranty for DGX typically runs 1-3 years, then standard or premium support becomes due — proportional to hardware value: typically 15-25 percent per year. For a DGX H100 (~250,000 EUR hardware value), that's 38-60,000 EUR maintenance per year. For a DGX GB200 NVL72 (~3M EUR hardware value), correspondingly 450-750,000 EUR per year. TPM reduces this to 30-50 percent — for an 8-box DGX cluster, savings quickly add up to six- to seven-figure amounts per year.

We service DGX platforms with OEM components from our own warehouse and certified refurbishing sources — DGX hardware is proprietary (NVLink topology, NVSwitch backplane, NVIDIA-specific power delivery, plus liquid cooling loop from GB200), component availability is structurally weaker than for standard servers. We continuously build the component pool for current DGX generations, with focus on GPU board replacement, NVSwitch modules and power supplies. Our engineers are specifically trained for DGX architecture — DGX maintenance is not comparable to standard x86 server service. CUDA, AI Enterprise, Base Command stay license-free active under TPM (software subscriptions run independently of hardware maintenance with NVIDIA).

30–60 %
Savings vs. NVIDIA standard/premium support
5-7 figures
Absolute annual savings on DGX clusters
DGX specialist
Engineer training for NVLink/NVSwitch architecture
CUDA stays
AI Enterprise, Base Command independent of TPM

Generations timeline & TPM coverage

Per hardware generation: vendor phase (slate) and TechCare coverage window (teal) up to ~5 years post-OEM EOSL.

Generation status of DGX line

DGX generations are all current — no EOSL foreseeable before 2030+. Factory warranty typically 1-3 years, then NVIDIA support or TPM. Oldest covered here (DGX A100, 2020) is out of factory warranty since 2023+.

Model family Released OEM support ends TPM status
DGX A100 320GB 2020 ca. 2028+ Supported
DGX A100 640GB 2020 ca. 2028+ Supported
DGX H100 2022 ca. 2030+ Supported
DGX H200 2024 ca. 2031+ Supported
DGX GH200 2024 ca. 2031+ Supported
DGX GB200 NVL72 2024–2025 ca. 2032+ Supported

As of 2026. EOSL data based on official vendor roadmaps and subject to change. Binding case-by-case information available on request.

What we deliver

OEM components

Our warehouse and certified refurbishing sources for DGX and Mellanox.

DGX specialist engineer

German-speaking technicians with NVLink/NVSwitch training, 4-hour response time guaranteed.

Flexible SLA per system

Parts Only, 5×9 NBD or 24×7×4 — freely combinable by location and criticality.

Multi-vendor contract

One contract for DGX, Mellanox and all other vendors. AI cluster stack consolidation.

Risk assessment

Component pool status per model before contract conclusion — honest disclosure.

CUDA & AI software stay

CUDA, AI Enterprise, Base Command Manager independent of hardware maintenance.

FAQ on DGX maintenance

Which DGX models do you service?
Current DGX family: DGX A100 (Ampere, 320GB and 640GB variants), DGX H100 (Hopper, 8x H100 SXM5 80GB), DGX H200 (Hopper refresh, 8x H200 141GB), DGX GH200 (Grace Hopper, 256x GH200 superchips with 144TB memory) and DGX GB200 NVL72 (Grace Blackwell, 72x B200 GPU plus 36x Grace, liquid-cooled rack-scale). Including all GPU boards, NVLink/NVSwitch modules, power supplies, cooling loop (GB200) and networking modules. For older DGX-1/-2 we have a separate spoke (DGX Legacy).
What does TPM cost for DGX compared to NVIDIA support?
30 to 60 percent savings. Specifically: DGX A100 640GB with 24×7×4 costs 30,000-45,000 EUR/year with NVIDIA premium support, 12,000-19,000 EUR with TechCare. DGX H100 correspondingly 38,000-60,000 EUR NVIDIA, 15,000-25,000 EUR TechCare. DGX H200 similar, slightly higher. DGX GH200 75,000-120,000 EUR NVIDIA, 30,000-50,000 EUR TechCare. DGX GB200 NVL72 (with 3M EUR hardware value) 450,000-750,000 EUR NVIDIA premium, with TechCare 180,000-320,000 EUR. For an 8-box DGX H100 cluster: 300-480k EUR NVIDIA, 120-200k EUR TechCare. Difference: 180-280k EUR per year.
Do CUDA, AI Enterprise and Base Command Manager continue to work without an NVIDIA contract?
Yes. CUDA toolkit, GPU drivers and all DGX-OS functions continue to work license-free on the hardware — all AI workloads (PyTorch, TensorFlow, Triton Inference, NeMo) stay functional. NVIDIA AI Enterprise as software subscription runs separately from hardware maintenance — customers actively using the subscription keep it with NVIDIA. Base Command Manager (BCM, cluster orchestration) is subscription-based and independent of maintenance contract. CUDA updates and newer GPU driver versions are freely available from NVIDIA — TPM customers can download them without restriction. Firmware updates on NVSwitch/BMC require an active NVIDIA contract but are usually uncritical with stable AI workloads.
How is component availability for DGX hardware?
Structurally weaker than for standard servers, but addressed with targeted component pool buildup. DGX hardware is proprietary — NVLink topology, NVSwitch backplane, NVIDIA-specific power delivery boards. We stock replacement GPU boards, NVSwitch modules, power supplies and networking modules for DGX A100/H100/H200. For very new generations (GH200, GB200), pool depth is limited — there we recommend hybrid setups: TPM for standard components (power supplies, fans, storage) plus selective NVIDIA subscription for GPU board coverage. At contract conclusion we create a risk assessment per DGX model — for critical workloads we recommend on-site spare components (negotiable in contract).
Which SLA levels do you recommend for DGX?
DGX in productive AI training workloads: 24×7×4 with German-speaking onsite engineer is standard. AI training jobs are often multi-day — outage of a DGX box mid-training means loss of training progress and possibly data loss. For AI inference production workloads (customer-facing LLMs, computer vision pipelines), 24×7×4 is mandatory due to direct service impact. For DGX in test/dev environments or as backup compute, 5×9 NBD can be economical. DGX GB200 NVL72 as liquid-cooled rack-scale system: 24×7×4 mandatory due to cooling loop complexity — cooling failure has high impact.
When is the natural entry point for TPM with DGX?
Factory warranty expiration. NVIDIA factory warranty for DGX typically runs 1-3 years depending on contract. Specifically: DGX A100 fleets from 2020-2022 have been out of factory warranty since 2023+ and are the natural TPM entry point. DGX H100 from 2022-2023 will run out of factory warranty from 2025-2026. DGX H200/GH200/GB200 are current — TPM is to be planned 1-3 years ahead here. We recommend: inventory check 6 months before warranty expiration, TPM contract from day 1 after warranty end — no gap, no NVIDIA standard support extension at full price as bridge.
Can we have DGX maintenance with Mellanox networking in the same contract?
Yes. Multi-class NVIDIA contracts are especially relevant for AI cluster builds — DGX AI compute needs Mellanox networking backend (Spectrum Ethernet or Quantum InfiniBand) plus ConnectX adapters and possibly BlueField DPUs. We offer all four NVIDIA classes (DGX, Mellanox adapters, Mellanox switching, plus DGX Legacy) in one contract, one point of contact, one SLA report set. Plus all other vendors (Supermicro AI servers, Dell PowerEdge GPU nodes, HPE Apollo) in the same construct.
How fast do we get a quote?
Within 48 hours after receipt of your inventory list with model, GPU configuration, factory warranty status and serial number.
Service performance

Real actuals Q1 2026 — straight from our ITIL ticketing.

99,2 %
Tickets resolved within agreed response time
2,4 h
Avg. first response on 4h SLA tier
88 %
First-time fix on initial dispatch
97 %
Spare part on site within 4 h, DACH depots
More from NVIDIA

Other NVIDIA models and service