OCR Image to Text Function Description

Web Image OCR

Automatically recognizes all characters and their locations in web images, extracts contact information in the images, and provides the information for data mining and post-processing.

OCR Image to Text Application Scenarios

Electronic documentation archive

Recognizes structured information and extracts signatures and seals for quick review.

Express waybill filling

Recognizes contact information in images and generates express waybills automatically, eliminating the need for manual input.

Contract upload and review

Automatically recognizes structured information and extracts signatures and seals for quick review.

Introduction to OCR

Optical Character Recognition (OCR) detects and extracts text from images, scanned copies, PDFs, and OFD files, and converts the recognition results into editable text.

OCR provides open APIs, so you can use programming languages such as Python and Java to call OCR APIs to extract text from images. OCR allows you to automate the collection of key data. It helps you build an intelligent service system to improve efficiency. For details about how to obtain APIs, see Optical Character Recognition API Reference.

OCR also provides software development kits (SDKs) for multiple programming languages. For details about how to use SDKs, see Optical Character Recognition SDK Reference.

You can use either of the following methods to call OCR APIs after subscription:

A software tool such as curl or Postman

  1. These are good options if you are more comfortable writing code, HTTP requests, and API calls. For details about how to use the tool, see Using Postman to Call the Passport OCR API.

Software development kit (SDK)

  1. SDKs for Java, Python, and Node.js are available for quick integration. For details about how to use the tool, see Calling the Passport OCR SDK.

OCR FAQs

  • Can OCR Recognize Text Files?

    OCR detects and extracts text only in images and cannot directly extract text from files such as Word, PDF, and Excel. You can convert these files to images.

  • Do OCR SDKs Need to Be Purchased?

    No. OCR SDKs are free of charge.

  • How Do I Select a Region for an OCR Package?

    Resource packages in different regions are isolated. Select a region according to your business requirements. For details about the regions where services are deployed, see Endpoints.

    Determine the service region before purchasing a service package.

  • Can a Package Be Refunded After Being Purchased?

    The package cannot be refunded after being purchased.

  • Will User Data Be Saved When OCR Is Used?

    1. With trustedness as the core quality concept, OCR provides you with cutting-edge, future-facing, and trusted cloud services by meeting the requirements on security, compliance, privacy, resilience, and transparency.

    2. OCR promises that all the uploaded images and the recognition results are not retained in any form and will be deleted immediately after the recognition results are returned.

    3. For details about the statement, see Privacy Statement and Site Terms. For trusted resources, see the White Papers.


  • Do OCR SDKs Need Maven to Manage Dependency Packages?

    Some packages on which the Python SDK depends, such as requests, need to be installed in the local environment. If the Python SDK is connected to the external network, you can use the pip install + Package name to install the packages.

    Java SDK does not need Maven to manage local dependency packages and can be directly used.

  • What Do I Do If the Subscribe Button Is Dimmed?

    Symptom

    After logging in to the OCR console, I cannot subscribe to an OCR service because the Subscribe button is dimmed.

    Possible Causes

    The possible cause is that the current user is an IAM user who does not have the permission to subscribe to the service. An account pays for and owns the resources and has full access permissions for the resources.

    Solution

    Log in to the OCR console using an account and subscribe to OCR.

  • Why Is the Actual Number of API Calls Inconsistent with the Record Displayed on the Management Console?

    The OCR console only records the number of successful API calls. The number of failed API calls is not recorded.

    To view the number of failed calls, perform the following operations:

    1. Log in to the management console.

    2. On the console home, choose Optical Character Recognition. The Optical Character Recognition console is displayed.

    3. Click the target service. Click View Monitoring Graph to go to the Cloud Eye console and view detailed service usage such as the number of successful or failed API calls.

  • How Do I View the API Usage?

    1. Log in to the OCR console.

    2. In the navigation pane on the left, choose Service Monitoring, and view the API usage.

We use cookies to improve our site and your experience. By continuing to browse our site you accept our cookie policy. Find out more