HGX2 Field Diagnostics Tool

Andrew Rodriguez
Andrew Rodriguez
  • Updated

Document Scope

This article covers how to utilize the HGX field diagnostics tool for troubleshooting HGX GPUs. These specialized GPUs are not easily removed from their installed chassis and as such cannot simply be placed into another system as a manner of A/B testing functionality. Instead, we rely on this tool to provide the insight necessary to determine the current state of the GPU.

Prerequisite

This tool needs to be on a bootable USB. If you already have the tool on the bootable USB, skip to 'Step 2'. The files can be found from this link:

https://exxact-support.s3-us-west-1.amazonaws.com/Test+Folder/hgx2-diagos_18.07.1-03_bundle.zip

Step 1: Create a bootable USB

I used Rufus 2.18, but basically any Rufus will work as long as you uncheck the "Create a bootable disk using..." options

Example of Rufus settings

image2020-8-28_10-33-26.png

Step 2: Boot to the USB

I have a PNY USB, and I used the UEFI option.

image2020-8-24_12-4-54.png

Step 3: Unpack Field Diagnostic Tool files

Use command below to unpack fie Field Diagnostics Tool files:

tar xfz 629-FKD03-2887-510.tgz

You should land into a command prompt: 

image2020-8-24_12-7-37.png

image2020-8-24_12-11-0.png

Step 4: Change directory into the unpacked files

Use command below to unpack fie Field Diagnostics Tool files:

cd xfz 629-FKD03-2887-510

image2020-8-24_12-12-27.png

Example of failed test:

image2020-8-24_12-18-6.png

Step 5: Provide logs generated by the test

Power off system once tests complete, remove the USB, and you can plug it in on your personal computer and find the logs under:

USB (Example) DGX2 (D:) → home → 629-FLD03-2887-510 → Logs → zipped files

image2020-8-24_12-22-6.png

The zipped files for the tests ran.

image2020-8-24_12-24-18.png

Was this article helpful?

0 out of 0 found this helpful

Have more questions? Submit a request

Comments

0 comments

Please sign in to leave a comment.