You are analyzing taxi trips in New York City. You leverage the Azure Data Factory to create data pipelines and
to orchestrate data movement.
You plan to develop a predictive model for 170 million rows (37 GB) of raw data in Apache Hive by using
Microsoft R Server to identify which factors contribute to the passenger tipping behavior.
All of the platforms that are used for the analysis are the same. Each worker node has eight processor cores
and 26 GB of memory.
Which type of Azure HDInsight cluster should you use to produce results as quickly as possible?

A.
Hadoop
B.
HBase
C.
Interactive Hive
D.
Spark
Explanation:
https://azure.microsoft.com/en-gb/blog/general-availability-of-hdinsight-interactive-query-blazing-fast-datawarehouse-style-queries-on-hyper-scale-data-2/
Interactive Query leverages [Hive on LLAP] in Apache Hive 2.1, brings the interactivity to your complex data warehouse style queries on large datasets stored on commodity cloud storage.
Today, we announce the general availability of the Interactive Query cluster type in Azure HDInsight (formerly known as Interactive Hive).
1
0