Which four code segments should you use in sequence?
DRAG DROP
You have a text file named Data/examples/product.txt that contains product information.
You need to create a new Apache Hive table, import the product information to the table, and then read the top
100 rows of the table.
Which four code segments should you use in sequence? To answer, move the appropriate code segments
from the list of code segments to the answer area and arrange them in the correct order.
Select and Place:
Which method call should you view in the YARN logs?
You are configuring the Hive views on an Azure HDInsight cluster that is configured to use Kerberos.
You plan to use the YARN logs to troubleshoot a query that runs against Apache Hadoop.
You need to view the method, the service, and the authenticated account used to run the query.
Which method call should you view in the YARN logs?
Which two parameters should you modify?
You use YARN to manage the resources for a Spark Thrift Server running on a Linux-based Apache Spark
cluster in Azure HDInsight.
You discover that the cluster does not fully utilize the resources. You want to increase resource allocation.You need to increase the number of executors and the allocation of memory to the Spark Thrift Server driver.
Which two parameters should you modify? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
How should you complete the query?
DRAG DROP
You have an Apache Hive cluster in Azure HDInsight.
You need to tune a Hive query to meet the following requirements:
Use the Tez engine.
Process 1,024 rows in a batch.
How should you complete the query? To answer, drag the appropriate values to the correct targets. Each value
may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to
view content.
NOTE: Each correct selection is worth one point.
Select and Place:
What is the result of running the command?
You need to minimize data transfers during the join ope…
You have an Apache Spark cluster in Azure HDInsight.
You plan to join a large table and a lookup table.
You need to minimize data transfers during the join operation.
What should you do?
Which file formats can you use?
You plan to copy data from Azure Blob storage to an Azure SQL database by using Azure Data Factory.
Which file formats can you use?
What should you configure?
You have an Apache Hive table that contains one billion rows.
You plan to use queries that will filter the data by using the WHERE clause. The values of the columns will be
known only while the data loads into a Hive table.
You need to decrease the query runtime.
What should you configure?
How should you complete the command?
DRAG DROPYou have an Apache HBase cluster in Azure HDInsight. The cluster has a table named sales that contains a
column family named customerfamily.
You need to add a new column family named customeraddr to the sales table.
How should you complete the command? To answer, drag the appropriate values to the correct targets. Each
value may be used once, more than once, or not at all. You may need to drag the split bar between panes or
scroll to view content.
NOTE: Each correct selection is worth one point.
Select and Place:
What should you use to build the solution?
You have an Azure HDInsight cluster.
You need a build a solution to ingest real-time streaming data into a nonrelational distributed database.
What should you use to build the solution?