Search_HUAWEI CLOUD

Updating Service Configurations - ModelArts

The current version supports modelarts.vm.cpu.2u, modelarts.vm.gpu.pnt004 (must be requested), modelarts.vm.ai1.snt3 (must be requested), and custom (available only when the service is deployed in a dedicated resource pool).

Help > ModelArts > API Reference > Service Management
Subscribing to a Built-in Commercial Service in ModelArts Studio (MaaS) - ModelArts

Tokens Per Minute (TPM): The number of tokens (input + output) processed per minute. Requests Per Minute (RPM): The number of requests processed per minute. If the model service has an RPM of 300, it means that up to 10 requests can be processed per second (300/30 = 10).

Help > ModelArts > ModelArts Studio (MaaS) User Guide > ModelArts Studio (MaaS) Real-Time Inference Services
ModelArts Service Selection - ModelArts
ModelArts Service Selection - ModelArts

ModelArts Studio (MaaS) MaaS console UI CN-Hong Kong ModelArts Standard ModelArts console UI All Huawei Cloud regions ModelArts Lite Server ModelArts console Create a Lite Server node through the UI or API.

Help > ModelArts > Service Overview
Using PyTorch to Create a Training Job (New-Version Training) - ModelArts

Using PyTorch to Create a Training Job (New-Version Training) This section describes how to train a model by calling ModelArts APIs.

Help > ModelArts > API Reference > Use Cases
Deploying Services - ModelArts
Deploying Services - ModelArts

A model is deployed as a web service on an edge node through Intelligent EdgeFabric (IEF). that provides a real-time test UI and monitoring capabilities. The service keeps running. You need to create a node on IEF beforehand.]

Help > ModelArts > API Reference > Service Management
Viewing Training Output Results - ModelArts

MindSpeed-LLM Throughput (tokens/s/p): Global batch size x sequence length/(Total number of PUs x elapsed time per iteration) x 1000. The global batch size (GBS) and sequence length (SEQ_LEN) are set during training and printed in the logs.

Help > ModelArts > Best Practices > LLM Training > Adapting Mainstream Open-Source Models to AscendFactory NPU Training Based on Lite Server
Creating a Production Training Job (New Version) - ModelArts

You can set CPU (vCPUs), Memory (GB), Ascend (PU), and Compute Nodes as required. Custom specifications must match or stay below the node specifications of the dedicated resource pool. For CPU specifications, you can only customize the number of vCPUs and memory.

Help > ModelArts > ModelArts User Guide (Standard) > Using ModelArts Standard to Train Models
Creating a Notebook Instance (New Page) - ModelArts

The notebook instances with remote SSH enabled have VS Code plug-ins (such as Python and Jupyter) and the VS Code server package pre-installed, which occupy about 1 GB persistent storage space. Key Pair Set a key pair after remote SSH is enabled.

Help > ModelArts > ModelArts User Guide (Standard) > Using Notebook for AI Development and Debugging
Obtaining Service Monitoring - ModelArts

Action Access Level Resource Type (*: required) Condition Key Alias Dependencies modelarts:service:getMonitor Read service * g:ResourceTag/<tag-key> - - URI GET /v1/{project_id}/services/{service_id}/monitor Table 1 Path Parameters Parameter Mandatory Type Description project_id Yes

Help > ModelArts > API Reference > Service Management
Starting a DevServer Supernode Server - ModelArts

Letters, digits, and hyphens (-) are allowed.

Help > ModelArts > API Reference > DevServer Management
Stopping a DevServer Supernode Server - ModelArts

Letters, digits, and hyphens (-) are allowed.

Help > ModelArts > API Reference > DevServer Management
Common Error Causes and Solutions - ModelArts

Adjust the values of micro-batch-size (MBS, minimum number of samples processed in a batch) and global-batch-size (GBS, number of samples processed in an iteration).

Help > ModelArts > Best Practices > LLM Training > Adapting Mainstream Open-Source Models to AscendFactory NPU Training Based on Lite Server
Changing the OS Image of the DevServer Server - ModelArts

The maximum size of the content to be injected (before encoding) is 32K. Range: N/A Default Value: N/A Response Parameters Status code: 200 Table 3 Response header parameters Parameter Type Description X-Request-Id String Link trace ID.

Help > ModelArts > API Reference > DevServer Management
Reinstalling the OS Image of the DevServer Server - ModelArts

The maximum size of the content to be injected (before encoding) is 32K. Range: N/A Default Value: N/A Response Parameters Status code: 200 Table 3 Response header parameters Parameter Type Description X-Request-Id String Link trace ID.

Help > ModelArts > API Reference > DevServer Management
What Is ModelArts? - ModelArts
What Is ModelArts? - ModelArts

ModelArts Studio (MaaS) ModelArts MaaS offers an end-to-end toolchain for foundation model production, along with compute resources and popular open-source models. It is designed for users who need to develop production-ready models using a MaaS platform.

Help > ModelArts > Service Overview
Obtaining Service Update Logs - ModelArts

Action Access Level Resource Type (*: required) Condition Key Alias Dependencies modelarts:service:getLogs Read service * g:ResourceTag/<tag-key> - - URI GET /v1/{project_id}/services/{service_id}/logs Table 1 Path Parameters Parameter Mandatory Type Description project_id Yes String

Help > ModelArts > API Reference > Service Management
Running a Single-Node Single-PU Training Job on ModelArts Standard - ModelArts

It is recommended that the Linux server have sufficient memory (more than 8 GB) and hard disk (more than 100 GB).

Help > ModelArts > Best Practices > Model Training > Running a Training Job on ModelArts Standard
Starting a Notebook Instance - ModelArts

Lowercase letters, digits, hyphens (-), underscores (_), and periods (.) are allowed. namespace String Definition: Organization to which the image belongs. You can create and view an organization on the Organization Management page of the SWR console.

Help > ModelArts > API Reference > Development Environment Management
Error ModelArts.4503 Occurred in Real-Time Service Prediction - ModelArts

If the model service (server) initiates a disconnection, but the connection is being used by ModelArts (client), a communication error occurs and this error code is returned.

Help > ModelArts > Troubleshooting > Inference Deployment > Service Prediction
Creating a Standard Dedicated Resource Pool - ModelArts

Node Pool Flavor Node Type Common node: A physical or virtual server that provides independent basic compute, storage, and network resources.

Help > ModelArts > ModelArts User Guide (Standard) > ModelArts Standard Resource Management

Total results: 96

Was this helpful?

Feedback

/200

Submit Cancel