MetaStudio

More Professional Tools for Digital Content Production

Intelligent Interaction
Product Overview

MetaStudio AI assistants speak accurately, wake automatically, and respond quickly, just like a real person.

Key Features
Enhanced interactivity with natural interruptions
E2E latency under 3 seconds
Scenario adaptation with just 1 SDK and 5 lines of code
Open ecosystem compatible with mainstream LLMs
Video Production
Product Overview

This one-stop platform reduces costs and boosts efficiency and productivity of your organization.

Key Features
Output resolution up to 4K
Text and audio control, with rapid 1080p compositing in just 1.5x video length
Automatic parsing of PowerPoint files
Video customization with an enormous number of preset templates and any materials you import (images, video, and audio)
Sensitive content detection
Support for API calling and SaaS delivery
Image Modeling
Product Overview

All that it takes to train a virtual avatar that speaks like its human reference is one video clip.

Key Features
Virtual avatar delivery in 7 working days
Encrypted storage and identity authorization of assets
Lip sync accuracy > 95%
Training that enables virtual avatars to walk, turn around, and hold objects in real scenes with intelligent choreography
AI correction for continuous eye contact and more realistic details
Voice Modeling
Product Overview

Voice cloning in 18 languages (including Chinese, English, Spanish, and Arabic) to reproduce natural speech

Key Features
Lifelike timbre reproduced after 7-day training based on just 10 minutes of mobile audio

Products Right for Your Every Need

Products Right for Your Every Need

  • Intelligent interaction
  • Video production
  • Image modeling
  • Voice modeling

Why Huawei Cloud MetaStudio?

Why Huawei Cloud MetaStudio?

Unparalleled Realism

  • Good articulation, lip sync accuracy > 95%

  • Expressive voices, with 10+ natural language styles

  • Broadcast-caliber pronunciation accuracy: 99.9%

Enhanced Interactivity

  • Interruption anytime

  • THX AAA noise reduction and AI VAD

  • Multilingual response latency under 3 seconds

Effortless Integration

  • Simplified development and quick integration with just 1 SDK and 5 lines of code

  • Open ecosystem compatible with mainstream LLMs

Efficient Digital Content Production for Diverse Needs

Intelligent customer service
Intelligent customer service

Technical service providers leverage the intelligent interaction capability of the platform to integrate virtual avatars into multiple terminal types. This takes voice customer service to new levels.

Key features
AI agents with custom appearances and voices make customer service more relatable.
Intelligent response is available in both text- and speech-based engagements.
News broadcast
News broadcast

Error-proof virtual anchors in one-off training

News broadcast requires precision and professionalism of anchors. This is exactly where virtual anchors can come in. They do not need memorization or make errors, and maintain the professionalism of their human counterparts.

Key features
Output resolution up to 4K
Text and audio control, with rapid 1080p compositing in just 1.5x video length
Automatic parsing of PowerPoint files
Video customization with any materials you import (images, video, and audio)
Education and training
Education and training

Efficient online course video recording with virtual avatars

Virtual avatars are never camera-shy. Lecturers now spend less time on course recording and more time and effort on polishing content. Virtual lecturers convey consistent messages and embody the professionalism of education providers.

Key features
Output resolution up to 4K
Text and audio control, with rapid 1080p compositing in just 1.5x video length
Automatic parsing of PowerPoint files
Video customization with any materials you import (images, video, and audio)
Healthcare
Healthcare

The cure for hiccups in healthcare video production

Virtual doctors are useful for making medical knowledge accessible in video before, during, and after diagnosis.

They are a perfect addition to hospitals, clinics, medical institutions, and vendors of medical devices, medicines, and healthcare products.

Key features
Output resolution up to 4K
Text and audio control, with rapid 1080p compositing complete in just 1.5x video length
Automatic parsing of PowerPoint files
Video customization with an enormous number of preset templates and any materials you import (images, video, and audio)

Continuous Business Innovation with Millions of Customers

Continuous Business Innovation with Millions of Customers

Embark on Your Journey to

Intelligent Digital Content Production


Now
Try Now