AI开发平台ModelArts-训练作业的日志出现detect failed(昇腾预检失败):问题现象

时间:2023-11-01 16:25:38

问题现象

训练启动的日志出现如下相关错误:

time="2023-05-27T07:07:08Z" level=error msg="detect failed, error: dsmi-checker detect failed, error: fork/exec /home/ma-user/modelarts/bin/detect/ascend_check: no such file or directory" file="ascend_check.go:56" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Servicetime="2023-05-27T07:07:13Z" level=error msg="[detect] ascend-check error, exiting..." file="run_train.go:94" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
support.huaweicloud.com/trouble-modelarts/modelarts_trouble_0145.html