What can you determine from the lift calculation?
You are using the Apriori algorithm to determine the likelihood that a person who owns a home has a good
credit score. You have determined that the confidence for the rules used in the algorithm is > 75%. You
calculate lift = 1.011 for the rule, “People with good credit are homeowners”. What can you determine from the
lift calculation?
Which rule has a confidence equal to 50%?
Consider a database with 4 transactions:
Transaction 1: {cheese, bread, milk}
Transaction 2: {soda, bread, milk}
Transaction 3: {cheese, bread}
Transaction 4: {cheese, soda, juice}
The minimum support is 25%. Which rule has a confidence equal to 50%?
you need to implement N-fold cross-validation after creating a regression model?
Under which circumstance do you need to implement N-fold cross-validation after creating a regression
model?
What is an appropriate data visualization to use ina presentation for an analyst audience?
What is an appropriate data visualization to use ina presentation for an analyst audience?
When would you use GROUP BY ROLLUP clause in your OLAP query?
When would you use GROUP BY ROLLUP clause in your OLAP query?
Which type of numeric value does a logistic regression model estimate?
Which type of numeric value does a logistic regression model estimate?
Which query interface would you recommend?
Your colleague, who is new to Hadoop, approaches you with a question. They want to know how best to
access their data. This colleague has a strong background in data flow languages and programming.
Which query interface would you recommend?
Which tool should they use?
The web analytics team uses Hadoop to process access logs. They now want to correlate this data with
structured user data residing in a production single-instance JDBC database. They collaborate with the
production team to import the data into Hadoop. Which tool should they use?
What does the R code do?
What does the R code do?
z <- f[1:10, ]
In R, functions like plot() and hist() are known aswhat?
In R, functions like plot() and hist() are known aswhat?