dcgmMonitor 插件

Deploy the k8s environment on your GPU machine, and install dcgm-exporter and Holoinsigh-Agent, as described in the documentation

dcgm-exporter

holoinsight-agent

By default, GPU data is collected after installation

Open page http://localhost:8080/integration/agentComp?tenant=default.

Install the DCGMMonitor plug-in on the Integration Components page

dcgm1.png Click to preview dcgm2.png

DCGMMonitor dashboards can be automatically generated to monitor GPU information dcgm3.png

dcgm4.png