Real-Time ASR

Real-Time ASR (RASR) converts continuous audio streams into text in real time, enabling faster speech recognition. RASR gives users the ability to subtitle in line with the video feed and record conference content in text format. With the instant text generation capabilities of the service, the applications are huge.



Fees start as low as ¥1.20 per hour.

Product Advantages
  • High Recognition Accuracy

    Adopts the latest generation of speech recognition and Deep Neural Network (DNN) technologies to greatly improve the anti-noise performance and recognition accuracy.

  • High Speed

    Integrates the language models, dictionaries, and acoustic models into a large neural network featuring impressive optimizations in the engineering to greatly increase the decoding speed, achieving faster recognition.

  • Multiple Recognition Modes

    Supports multiple real-time speech recognition modes, including streaming, continuous, and single-sentence, to suit different application scenarios.

  • Customization Service

    Allows you to customize the language-layer model in a specific vertical domain to better recognize proprietary words and industry terms, adding a significant boost to accuracy.

Application Scenarios
  • Live Subtitling

  • Real-Time Conference Recording

  • Instant Text Generation

Live Subtitling

Live Subtitling

Converts the audio from live video streams into subtitles in real time, optimizing the viewing experience while bringing much added convenience to content monitoring.

Advantages

Fast

Real-time live speech recognition

Accurate

High recognition accuracy

Real-Time Conference Recording

Real-Time Conference Recording

Converts the audio in a video or conference call into text in real time, and allows you to quickly verify, modify, and retrieve the text.

Advantages

Efficient

Fast conference recording

Accurate

High recognition accuracy

Instant Text Generation

Instant Text Generation

Records your speech and converts it into text on mobile apps, facilitating subsequent text processing and archiving, thereby saving manpower and time with high efficiency in the conversion process.

Advantages

Efficient and Convenient

Fast recording of speech as text

Uninterrupted Recognition

Supports uninterrupted recognition of voice data streams longer than 60 seconds.

Usage Guides

Create an Account and Experience HUAWEI CLOUD for Free

Register Now