AI开发平台MODELARTS-支持的模型列表:训练支持的模型列表

时间:2025-06-24 10:36:33

训练支持的模型列表

本方案支持以下模型的训练,如表1所示。

表1 支持的模型列表

序号

支持模型

支持模型参数量

权重文件获取地址

1

Qwen2

qwen2-0.5b

https://huggingface.co/Qwen/Qwen2-0.5B-Instruct

2

qwen2-1.5b

https://huggingface.co/Qwen/Qwen2-1.5B-Instruct

3

qwen2-7b

https://huggingface.co/Qwen/Qwen2-7B-Instruct

4

qwen2-72b

https://huggingface.co/Qwen/Qwen2-72B-Instruct

5

GLMv4

glm4-9b

https://huggingface.co/THUDM/glm-4-9b-chat

6

mixtral

mixtral-8x7b

https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1

7

llama3.1

llama3.1-8b

https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct

8

llama3.1-70b

https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct

9

Qwen2.5

qwen2.5-0.5b

https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct

10

qwen2.5-7b

https://huggingface.co/Qwen/Qwen2.5-7B-Instruct

11

qwen2.5-14b

https://huggingface.co/Qwen/Qwen2.5-14B-Instruct

12

qwen2.5-32b

https://huggingface.co/Qwen/Qwen2.5-32B-Instruct

13

qwen2.5-72b

https://huggingface.co/Qwen/Qwen2.5-72B-Instruct

14

Qwen3

qwen3-0.6b

https://huggingface.co/Qwen/Qwen3-0.6B

15

qwen3-1.7b

https://huggingface.co/Qwen/Qwen3-1.7B

16

qwen3-4b

https://huggingface.co/Qwen/Qwen3-4B

17

qwen3-8b

https://huggingface.co/Qwen/Qwen3-8B

18

qwen3-14b

https://huggingface.co/Qwen/Qwen3-14B

19

qwen3-32b

https://huggingface.co/Qwen/Qwen3-32B

20

Qwen3_MOE

qwen3_moe-30B_A3B

https://huggingface.co/Qwen/Qwen3-30B-A3B

21

qwen3_moe-235B_A22B

https://huggingface.co/Qwen/Qwen3-235B-A22B

22

llama3.2

llama3.2-1b

https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct

23

llama3.2-3b

https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct

24

DeepSeek

DeepSeek-V3

https://huggingface.co/deepseek-ai/DeepSeek-V3-Base/tree/main

25

DeepSeek-R1

https://huggingface.co/deepseek-ai/DeepSeek-R1/tree/main

support.huaweicloud.com/bestpractice-modelarts/modelarts_llm_train_5905033.html