Search_HUAWEI CLOUD

Submitting a Spark SQL Job in DLI Using Hudi - Data Lake Insight

So, you must set LOCATION to an OBS path.

Help > Data Lake Insight > Hudi SQL Syntax Reference > Using Hudi to Develop Jobs in DLI
Resource-Related SDKs - Data Lake Insight

The options are as follows: jar: JAR file Pyfile: User Python file file: User file modelfile: User AI model file obs_jar_paths: OBS path of the resource package. The parameter format is {bucketName}.{obs domain name}/{jarPath}/{jarName}.

Help > Data Lake Insight > SDK Reference > DLI SDK V1 (Not Recommended) > Python SDK (DLI SDK V1)
Submitting a Flink SQL Job in DLI Using Hudi - Data Lake Insight

Configure an OBS bucket. Enable checkpointing. Checkpointing must be enabled when using Hudi. Submit the job and check the FlinkUI and logs. Click Submit in the upper right corner of the page.

Help > Data Lake Insight > Hudi SQL Syntax Reference > Using Hudi to Develop Jobs in DLI
Advantages - Data Lake Insight
Advantages - Data Lake Insight

Supported data sources Cloud: OBS, RDS, GaussDB(DWS), CSS, MongoDB, and Redis On-premises: self-built databases, MongoDB, and Redis Cloud: OBS On-premises: HDFS Ecosystem compatibility DLV, Yonghong BI, and Fanruan BI Big data ecosystem tool Custom image Supported.

Help > Data Lake Insight > Service Overview
Submitting a Spark Jar Job in DLI Using Hudi - Data Lake Insight

View Log: Redirects to the OBS page where you can see the complete log archive addresses of the job, including commit logs, driver logs, and executor logs. You can download the logs here.

Help > Data Lake Insight > Hudi SQL Syntax Reference > Using Hudi to Develop Jobs in DLI
Spark Job Development - Data Lake Insight

How Do I Set Up AK/SK So That a General Queue Can Access Tables Stored in OBS? How Do I View the Resource Usage of DLI Spark Jobs? How Do I Use Python Scripts to Access the MySQL Database If the pymysql Module Is Missing from the Spark Job Results Stored in MySQL?

Help > Data Lake Insight > FAQs > Spark Jobs
Inserting Data - Data Lake Insight
Inserting Data - Data Lake Insight

You can configure the spark.sql.shuffle.partitions parameter to set the number of files to be inserted into the OBS bucket in the non-DLI table.

Help > Data Lake Insight > Spark SQL Syntax Reference > Data
Query Type - Data Lake Insight
Query Type - Data Lake Insight

You can retain the default value. .load("obs://bucket/to_your_table"); // Specify the path of the Hudi table to read. DLI supports only OBS paths. dataFrame.show(100); // 2.

Help > Data Lake Insight > Hudi SQL Syntax Reference > Hudi Table Overview
Hudi Source Table - Data Lake Insight
Hudi Source Table - Data Lake Insight

Select Save Job Log, and specify the OBS bucket for saving job logs.

Help > Data Lake Insight > Flink SQL Syntax Reference > Flink OpenSource SQL 1.15 Syntax Reference > Connectors > Hudi
Using Flink Jar to Read and Write Data from and to DIS - Data Lake Insight

Possible values are as follows: JAR: JAR file PyFile: User Python file File: User file ModelFile: User AI model file JAR OBS Path Select the OBS path of the corresponding package. NOTE: The program package must be uploaded to OBS in advance. Only files can be selected.

Help > Data Lake Insight > Developer Guide > Flink Jobs
DROP TABLE - Data Lake Insight
DROP TABLE - Data Lake Insight

When this statement is used to drop a foreign table, the data in the OBS directory is not automatically deleted. When deleting an MOR table, the tables with the _rt and _ro suffixes are not automatically deleted. To delete them, you need to execute a DROP statement separately.

Help > Data Lake Insight > Hudi SQL Syntax Reference > DLI Hudi SQL Syntax Reference > Hudi DDL Syntax
Using Flink Jar to Connect to Kafka that Uses SASL_SSL Authentication - Data Lake Insight

OBS Path: Specify the OBS path of the KafkaToKafka.properties file. Group Name: Enter a name for a new group or select an existing group name. Figure 6 Creating a DLI package Create a Flink Jar job and run it.

Help > Data Lake Insight > Developer Guide > Flink Jobs
What Is the Recommended Configuration for a Flink Job? - Data Lake Insight

Select Save Job Log and select an OBS bucket. If the bucket is not authorized, click Authorize. This allows job logs be saved to your OBS bucket after a job fails for fault locating.

Help > Data Lake Insight > FAQs > Flink Jobs > Flink Job Performance Tuning
What Are the Differences Between DLI Flink and MRS Flink? - Data Lake Insight

Upstream and downstream data connection Open-source connectors and out-of-the-box connectors for data sources including databases (RDS and GaussDB), message queues (DMS), data warehouses (GuassDB DWS), and object storage (OBS).

Help > Data Lake Insight > FAQs > DLI Basics
Viewing All Tables - Data Lake Insight
Viewing All Tables - Data Lake Insight

For details, see Creating an OBS Table or Creating a DLI Table.

Help > Data Lake Insight > Spark SQL Syntax Reference > Tables > Viewing a Table
DLI Permissions Management - Data Lake Insight

How Do I Resolve an Unauthorized OBS Bucket Error?

Help > Data Lake Insight > FAQs
Configuring Table Permissions on the DLI Console - Data Lake Insight

For details about the OBS table permissions, see Table 2. Figure 4 Granting OBS table permissions to a user Figure 5 Granting OBS table permissions to a project Table 2 Parameter description Parameter Description Authorization Object Select User or Project.

Help > Data Lake Insight > User Guide > Creating Databases and Tables > Managing Table Resources on the DLI Console
Table-Related SDKs - Data Lake Insight
Table-Related SDKs - Data Lake Insight

You can use it to create a table for storing OBS data.

Help > Data Lake Insight > SDK Reference > DLI SDK V1 (Not Recommended) > Java SDK (DLI SDK V1) > SDKs Related to SQL Jobs
Constraints on Hudi Tables - Data Lake Insight

Only OBS tables can be created, which means the table path must be configured through the LOCATION parameter. When using the metadata service provided by LakeFormation, both internal and external tables are supported.

Help > Data Lake Insight > Hudi SQL Syntax Reference > Hudi Table Overview
EOS Announcement for DLI Spark 3.1.1 - Data Lake Insight

Improving the performance of OBS Committer when writing small files Improved the performance of Object Storage Service (OBS) when writing small files, improving data transfer efficiency.

Help > Data Lake Insight > Product Bulletin > Product Bulletin

Total results: 405

Was this helpful?

Feedbacks

/200

Submit Feedback Cancel