PrepAway - Latest Free Exam Questions & Answers

You need to ensure that the compute environment is created on-demand and removed when the process is completed

You are designing an Azure Data Factory pipeline for processing data. The pipeline will process data that is stored in general-purpose standard Azure storage.

You need to ensure that the compute environment is created on-demand and removed when the process is completed.

Which type of activity should you recommend?

A. Databricks Python activity
B. Data Lake Analytics U-SQL activity

C. HDInsight Pig activity

D. Databricks Jar activity

Explanation:
The HDInsight Pig activity in a Data Factory pipeline executes Pig queries on your own or on-demand HDInsight cluster.

References:
https://docs.microsoft.com/en-us/azure/data-factory/transform-data-using-hadoop-pig


Leave a Reply