Are there any specific tests or playbooks that could be used to run and verify that the spark is not defective/crashing unexpectedly/overheating, etc. like what some users appear to be reporting (i.e., DGX Spark. low fan speed, high temps, device very hot). Am assuming DGX Spark is meant to be capable of extended periods of training/fine tuning?
Are there specific metrics in DGX Dashboard or other available metrics to confirm it is normal?