Question 1

Which DGX models do you service?

Accepted Answer

Current DGX family: DGX A100 (Ampere, 320GB and 640GB variants), DGX H100 (Hopper, 8x H100 SXM5 80GB), DGX H200 (Hopper refresh, 8x H200 141GB), DGX GH200 (Grace Hopper, 256x GH200 superchips with 144TB memory) and DGX GB200 NVL72 (Grace Blackwell, 72x B200 GPU plus 36x Grace, liquid-cooled rack-scale). Including all GPU boards, NVLink/NVSwitch modules, power supplies, cooling loop (GB200) and networking modules. For older DGX-1/-2 we have a separate spoke (DGX Legacy).

Question 2

What does TPM cost for DGX compared to NVIDIA support?

Accepted Answer

30 to 60 percent savings. Specifically: DGX A100 640GB with 24×7×4 costs 30,000-45,000 EUR/year with NVIDIA premium support, 12,000-19,000 EUR with TechCare. DGX H100 correspondingly 38,000-60,000 EUR NVIDIA, 15,000-25,000 EUR TechCare. DGX H200 similar, slightly higher. DGX GH200 75,000-120,000 EUR NVIDIA, 30,000-50,000 EUR TechCare. DGX GB200 NVL72 (with 3M EUR hardware value) 450,000-750,000 EUR NVIDIA premium, with TechCare 180,000-320,000 EUR. For an 8-box DGX H100 cluster: 300-480k EUR NVIDIA, 120-200k EUR TechCare. Difference: 180-280k EUR per year.

Question 3

Do CUDA, AI Enterprise and Base Command Manager continue to work without an NVIDIA contract?

Accepted Answer

Yes. CUDA toolkit, GPU drivers and all DGX-OS functions continue to work license-free on the hardware — all AI workloads (PyTorch, TensorFlow, Triton Inference, NeMo) stay functional. NVIDIA AI Enterprise as software subscription runs separately from hardware maintenance — customers actively using the subscription keep it with NVIDIA. Base Command Manager (BCM, cluster orchestration) is subscription-based and independent of maintenance contract. CUDA updates and newer GPU driver versions are freely available from NVIDIA — TPM customers can download them without restriction. Firmware updates on NVSwitch/BMC require an active NVIDIA contract but are usually uncritical with stable AI workloads.

Question 4

How is component availability for DGX hardware?

Accepted Answer

Structurally weaker than for standard servers, but addressed with targeted component pool buildup. DGX hardware is proprietary — NVLink topology, NVSwitch backplane, NVIDIA-specific power delivery boards. We stock replacement GPU boards, NVSwitch modules, power supplies and networking modules for DGX A100/H100/H200. For very new generations (GH200, GB200), pool depth is limited — there we recommend hybrid setups: TPM for standard components (power supplies, fans, storage) plus selective NVIDIA subscription for GPU board coverage. At contract conclusion we create a risk assessment per DGX model — for critical workloads we recommend on-site spare components (negotiable in contract).

Question 5

Which SLA levels do you recommend for DGX?

Accepted Answer

DGX in productive AI training workloads: 24×7×4 with German-speaking onsite engineer is standard. AI training jobs are often multi-day — outage of a DGX box mid-training means loss of training progress and possibly data loss. For AI inference production workloads (customer-facing LLMs, computer vision pipelines), 24×7×4 is mandatory due to direct service impact. For DGX in test/dev environments or as backup compute, 5×9 NBD can be economical. DGX GB200 NVL72 as liquid-cooled rack-scale system: 24×7×4 mandatory due to cooling loop complexity — cooling failure has high impact.

Question 6

When is the natural entry point for TPM with DGX?

Accepted Answer

Factory warranty expiration. NVIDIA factory warranty for DGX typically runs 1-3 years depending on contract. Specifically: DGX A100 fleets from 2020-2022 have been out of factory warranty since 2023+ and are the natural TPM entry point. DGX H100 from 2022-2023 will run out of factory warranty from 2025-2026. DGX H200/GH200/GB200 are current — TPM is to be planned 1-3 years ahead here. We recommend: inventory check 6 months before warranty expiration, TPM contract from day 1 after warranty end — no gap, no NVIDIA standard support extension at full price as bridge.

Question 7

Can we have DGX maintenance with Mellanox networking in the same contract?

Accepted Answer

Yes. Multi-class NVIDIA contracts are especially relevant for AI cluster builds — DGX AI compute needs Mellanox networking backend (Spectrum Ethernet or Quantum InfiniBand) plus ConnectX adapters and possibly BlueField DPUs. We offer all four NVIDIA classes (DGX, Mellanox adapters, Mellanox switching, plus DGX Legacy) in one contract, one point of contact, one SLA report set. Plus all other vendors (Supermicro AI servers, Dell PowerEdge GPU nodes, HPE Apollo) in the same construct.

Question 8

How fast do we get a quote?

Accepted Answer

Within 48 hours after receipt of your inventory list with model, GPU configuration, factory warranty status and serial number.

Model family	Released	OEM support ends	TPM status
DGX A100 320GB	2020	ca. 2028+	Supported
DGX A100 640GB	2020	ca. 2028+	Supported
DGX H100	2022	ca. 2030+	Supported
DGX H200	2024	ca. 2031+	Supported
DGX GH200	2024	ca. 2031+	Supported
DGX GB200 NVL72	2024–2025	ca. 2032+	Supported

NVIDIA DGX Maintenance — vendor-independent service for DGX A100 to GB200

Which DGX models we service

Why TPM maintenance for NVIDIA DGX

Generations timeline & TPM coverage

Generation status of DGX line

What we deliver

OEM components

DGX specialist engineer

Flexible SLA per system

Multi-vendor contract

Risk assessment

CUDA & AI software stay

TechCare vs. NVIDIA DGX A100 / H100 / H200 / GH200 / GB200

FAQ on DGX maintenance

Real actuals Q1 2026 — straight from our ITIL ticketing.

Save on NVIDIA DGX A100 / H100 / H200 / GH200 / GB200 without risk

Other NVIDIA models and service