Data Lake Insight

Data Lake Insight (DLI) is a fully-managed big data processing and analysis service based on Apache Spark. Without data migration, DLI provides you with insights from heterogeneous data of various cloud services by using SQL and Spark programs.

Pay per use, ¥1.4/hour for one compute unit

Learn more
  • You are advised to deploy DLI and  together to provide the one-stop IDE platform for data development.

Product Advantages
  • Ease of Use

    DLI frees you from having to manage infrastructure, delivering zero maintenance costs and second-level provisioning. It supports standard SQL and SparkSQL APIs.

  • Cross-Source Analysis

    You can query and explore data from various sources quickly, without having to load and convert the data. Multiple data formats and analytical dimensions are supported.

  • Multi-tenant

    Computing resources are isolated between tenants to meet job SLAs. Your data rights can be restricted to a specific table or column for more precise management.

  • Scalable

    Auto scaling of storage and computing resources allows you to query data without worrying about whether you have sufficient resources.

Application Scenarios
  • Multi-Dimensional Analysis

  • Heterogeneous Data Analysis

  • Historical Data Querying

  • Large-Scale Log Analysis

Multi-Dimensional Analysis

Multi-Dimensional Analysis

DLI displays the repeat purchase rate and conversion rate in BI reports. It gives reliable data support for better ad placement, operation decision-making, and product management in e-commerce.


Ultra-Low Cost

With DLI, you pay only for the queries that you run. You are charged ¥ 0.3 per GB data scanned by your queries.

Compatible with Mainstream Ecosystems

DLI is fully compatible with Apache Spark. You can customize processing tasks.

Flexible Preprocessing

You can adopt intelligent pre-aggregation to mine valuable data so that your data to be analyzed will be cut down to 1/10 or even 1/100 of original.

Heterogeneous Data Analysis

Heterogeneous Data Analysis

With DLI, you can perform correlation analysis on data on OBS, CloudTable, and RDS to implement your needs.


No Need for Data Migration

You can directly run combined queries by using SQL, instead of complex ETL operations.

Compatible with Mainstream Ecosystems

Compatible with Apache Spark, DLI allows you to customize complex processing tasks.

Support of Multiple Data Formats

DLI allows you to analyze data in raw formats, such as CSV, JSON, Parquet, and ORC, in OBS.

Historical Data Querying

IoT: Fleet Management

DLI can help manage diversified cargo fleets of millions of vehicles precisely and cost-effectively by analyzing driving behavior based on historical data and playing back transportation routes.


Huge Volumes of Data

GB- or even EB-scale data can be stored on OBS.

No Data Conversion

You can run SQL statements directly to analyze the historical data stored on OBS, instead of having to load or convert the data.

Various Data Formats

DLI works with various data formats, including TXT, CSV, JSON, Parquet, and ORC. Data stored on OBS can be analyzed without converting its format.

Related Services



Large-Scale Log Analysis

Online Education: Learning Behavior Analysis

DLI uses students' learning records stored on OBS to analyze their learning behavior. It helps teachers, parents, and students enhance students' learning efficiency based on the analysis results.


Low Cost

Huge volumes of log data can be stored on OBS at a low cost.

Easy to Use

You can use SQL statements to perform the analysis, instead of having to set up an extra system and editing complex programs.

Diversified Analytical Dimensions

You can query and analyze data from various aspects, such as the learning duration and associated knowledge points, facilitating innovative services.

Related Services



  • Low Skill Demands

    You can access DLI via various modes as well as use standard SQL to implement your needs.

  • OBS Data Querying

    You can query OBS data directly. DLI is an out-of-box service with flexibility.

Low Skill Demands

  • You can access DLI using the web console or through RESTful APIs, JDBC, or ODBC.

  • DLI supports the standard SQL2003 and is compatible with SparkSQL/HQL and TPC-H/TPC-DS.

OBS Data Querying

  • Storage and computing resources are separate. Data can be stored on OBS and queried using SQL statements.

  • DLI is ideal for quick, ad hoc querying. You can query data at any time and without preparing resources in advance or performing removal operations afterwards.

  • Secure and Available

    DLI complies with robust authentication systems and provides 99.999999999% data durability.

  • Multi-tenant

    DLI isolates resources between tenants and provides fine-grained data permission control.

Secure and Available

  • HUAWEI CLOUD has passed certification with the following security authentication systems:

    C-STAR, MIIT trusted cloud, the Office of the Central Leading Group for Cyberspace Affairs' cloud service cybersecurity evaluation, MPS level III cloud service classified protection, and ISO27001.

  • Security is ensured by user access authentication, encrypted internal service communication, cross-domain bidirectional authentication, and encrypted data transmission.


  • Resource queues can be applied for on a tenant-by-tenant basis. Enterprise tenants can specify queues for users in different departments to isolate computing resources and meet each department's SLA.

  • Enterprise administrators can control internal users' data permissions, such as those for creating and deleting tables or querying specific columns.

Create an Account and Experience HUAWEI CLOUD for Free

Register Now