PrepAway - Latest Free Exam Questions & Answers

you need to do in order to run Impala on the cluster and submit jobs from the command line of the gateway mach

You have a Hadoop cluster HDFS, and a gateway machine external to the cluster from which
clients submit jobs. What do you need to do in order to run Impala on the cluster and submit jobs
from the command line of the gateway machine?

PrepAway - Latest Free Exam Questions & Answers

A.
Install the impalad daemon statestored daemon, and daemon on each machine in the cluster,
and the impala shell on your gateway machine

B.
Install the impalad daemon, the statestored daemon, the catalogd daemon, and the impala
shell on your gateway machine

C.
Install the impalad daemon and the impala shell on your gateway machine, and the statestored
daemon and catalogd daemon on one of the nodes in the cluster

D.
Install the impalad daemon on each machine in the cluster, the statestored daemon and
catalogd daemon on one machine in the cluster, and the impala shell on your gateway machine

E.
Install the impalad daemon, statestored daemon, and catalogd daemon on each machine in the
cluster and on the gateway node

8 Comments on “you need to do in order to run Impala on the cluster and submit jobs from the command line of the gateway mach

  1. Matt says:

    Impala daemon needs to be installed in all data nodes , because the any one arbitary node will receive the request and split the job across all the impala daemon , that is why impala is fast over hive processing




    0



    0
  2. ashfaque says:

    C is correct one….while configuaring cluster using(CDH5 or later) you just need to configure imalad daemon on gateway node(access node/edge node) you will be able to access imapla shell from that gateway node and statestored daemon and catalogd daemon on one of the nodes in the cluster it is not recommanded to configure impalad daemon on each nodes of cluster.




    0



    0
  3. Manoj Sekharan says:

    D is the right answer. Following are the main components of impala.

    catalogd—> The metadata coordination (any one machine)
    Statestored–>Nameservice that tracks location of impalad services in the cluster(anyone machine)
    impalad–> on ecah machine in your cluster.(all datanodes)
    Imapala shell–> on the gateway node.




    0



    0

Leave a Reply

Your email address will not be published. Required fields are marked *