检测到您已登录华为云国际站账号,为了您更好的体验,建议您访问国际站服务网站 https://www.huaweicloud.com/intl/zh-cn
不再显示此消息
The value 0 indicates the default workspace. ai_project String AI project to which a specified algorithm belongs. The default value is default-ai-project. The AI project has been brought offline.
KVM ki1.4xlarge.4 16 64 12/6 140 6 4 2 × 3,200 GiB KVM ki1.6xlarge.4 24 96 15/8.5 200 6 8 3 × 3,200 GiB KVM ki1.8xlarge.4 32 128 18/10 260 6 8 4 × 3,200 GiB KVM ki1.12xlarge.4 48 192 25/16 350 6 16 6 × 3,200 GiB KVM ki1.16xlarge.4 64 228 30/20 400 6 16 8 × 3,200 GiB KVM Kunpeng AI
Options: 0: OBS bucket (default value) 1: GaussDB(DWS) 2: DLI 3: RDS 4: MRS 5: AI Gallery 6: Inference service schema_maps Array of SchemaMap objects Schema mapping information corresponding to the table data. source_info SourceInfo object Information required for importing a table
Editing Training Code and Saving the Model Training code and the code for saving the model are closely related to the AI engine you use. The following uses the TensorFlow framework as an example.
The code is actually executed in the cloud development environment, and the Ascend AI resources on the cloud are used. In this way, you compile and modify code locally and run the code in the cloud. Run the code in the local IDE. The logs can be displayed locally.
The OBS object path refers to the OBS object URL. kind Yes String File type of a package group. jar: JAR file pyFile: User Python file file: User file modelFile: User AI model file NOTE: If the same group of packages to be uploaded contains different file types, select file as the
The K8S node contains one of the following taints: node.kubernetes.io/unreachable node.kubernetes.io/not-ready A050203 Runtime Disconnection The number of normal AI cards does not match the actual capacity. The GPU or NPU is disconnected.
For details about other npu-smi commands, see Atlas 800 AI Training Server npu-smi Command Reference (Model 9000). Important Notes Logs are recorded based on the system time. NPU synchronizes the system time. To change the system time, run the date command.
Attackers use AI and machine learning technologies to accelerate the iteration of attack tools and methods. For example, an advanced persistent threat (APT) is a covert and persistent network attack.
Complete Training Code Example The training code is closely related to the AI engine you use. The following uses the TensorFlow framework as an example. Before using this case, you need to download the mnist.npz file and upload it to the OBS bucket.
The code is actually executed in the cloud development environment, and the Ascend AI resources on the cloud are used. In this way, you compile and modify code locally and run the code in the cloud. Run the code in the local IDE. The logs can be displayed locally.
When you need a stable and available snapshot for tasks such as AI training, you need to publish the snapshot. Syntax Create a snapshot. You can use the CREATE SNAPSHOT... AS and CREATE SNAPSHOT... FROM statements to create a data table snapshot.
Notes and Constraints To support Kubernetes' default GPU scheduling on GPU nodes, the CCE AI Suite (NVIDIA GPU) add-on must be of v2.0.10 or later, and the Volcano Scheduler add-on must be of v1.10.5 or later. Example of Shared GPU Scheduling Use kubectl to access the cluster.
The CCE AI Suite (NVIDIA GPU) add-on has been installed in the cluster, and the add-on version is 2.0.10 or later. At least one NVIDIA GPU node is available in the cluster.
Therefore, the VPC network model applies to scenarios that have high requirements on performance, such as AI computing and big data computing.
Supported Supported Supported Supported UI Video call Video call Supported Supported Supported Supported Not supported UI Other Beautification Supported Not supported Supported Supported Not supported UI Virtual background Supported Supported Supported Supported Not supported UI AI
The Huawei Cloud QingTian architecture with hardware-software synergy and top-flight AI algorithms ensures that there is no freezing and the latency is extremely low when users are playing games and audios or videos.
See Table 8. ai_project Object AI project. For details, see Table 9. error_code String Error code. For details, see Error Codes. queuing_info Object Queuing information. For details, see Table 17. user Object User information.
Table 12 AIProject parameters Parameter Type Description id String AI project ID.
(Optional) GPU Quota Configurable only when the cluster contains GPU nodes and the CCE AI Suite (NVIDIA GPU) add-on has been installed. Do not use: No GPU will be used. GPU card: The GPU is dedicated for the container.