What can you determine from the lift calculation?
You are using the Apriori algorithm to determine the likelihood that a person who owns a home
has a good credit score. You have determined that the confidence for the rules used in the
algorithm is > 75%. You calculate lift = 1.011 for the rule, “People with good credit are
homeowners”. What can you determine from the lift calculation?
Which rule has a confidence equal to 50%?
Consider a database with 4 transactions:
Transaction 1: {cheese, bread, milk}
Transaction 2: {soda, bread, milk}
Transaction 3: {cheese, bread}
Transaction 4: {cheese, soda, juice}
The minimum support is 25%. Which rule has a confidence equal to 50%?
you need to implement N-fold cross-validation after creating a regression model?
Under which circumstance do you need to implement N-fold cross-validation after creating a
regression model?
What is an appropriate data visualization to use in a presentation for an analyst audience?
What is an appropriate data visualization to use in a presentation for an analyst audience?
When would you use GROUP BY ROLLUP clause in your OLAP query?
When would you use GROUP BY ROLLUP clause in your OLAP query?
Which type of numeric value does a logistic regression model estimate?
Which type of numeric value does a logistic regression model estimate?
Which query interface would you recommend?
Your colleague, who is new to Hadoop, approaches you with a question. They want to know how
best to access their data. This colleague has a strong background in data flow languages and
programming.
Which query interface would you recommend?
Which tool should they use?
The web analytics team uses Hadoop to process access logs. They now want to correlate this
data with structured user data residing in a production single-instance JDBC database. They
collaborate with the production team to import the data into Hadoop. Which tool should they use?
What does the R code z <- f[1:10, ] do?
What does the R code
z <- f[1:10, ]
do?
In R, functions like plot() and hist() are known as what?
In R, functions like plot() and hist() are known as what?