Which tool would you recommend to this colleague?
While having a discussion with your colleague, this person mentions that they want to perform K-means clustering on text file data stored in HDFS.
Which tool would you recommend to this colleague?
What should you deliver to the production team, along with your commented code?
You have completed your model and are handing it off to be deployed in production. What should you deliver to the production team, along with your commented code?
In the MapReduce framework, what is the purpose of the Map Function?
In the MapReduce framework, what is the purpose of the Map Function?
What is the confidence of the rule (hat, scarf) -> gloves?
Consider these itemsets:
(hat, scarf, coat)
(hat, scarf, coat, gloves)
(hat, scarf, gloves)
(hat, gloves)
(scarf, coat, gloves)
What is the confidence of the rule (hat, scarf) -> gloves?
How are window functions different from regular aggregate functions?
How are window functions different from regular aggregate functions?
Which method should the data scientist try first?
A data scientist is asked to implement an article recommendation feature for an on-line magazine.
The magazine does not want to use client tracking technologies such as cookies or reading history. Therefore, only the style and subject matter of the current article is available for making recommendations. All of the magazine’s articles are stored in a database in a format suitable for analytics.
Which method should the data scientist try first?
What is your next step?
You have been assigned to do a study of the daily revenue effect of a pricing model of online transactions. You have tested all the theoretical models in the previous model planning stage, and all tests have yielded statistically insignificant results. What is your next step?
What describes a true property of Logistic Regression method?
What describes a true property of Logistic Regression method?
Which word or phrase completes the statement? Structured data is to OLAP data as quasi-structured data is to__
Which word or phrase completes the statement? Structured data is to OLAP data as quasi-structured data is to____
Which algorithm is the most appropriate for building a predictive model for subscribers?
You are asked to create a model to predict the total number of monthly subscribers for a specific magazine. You are provided with 1 year’s worth of subscription and payment data, user demographic data, and 10 years worth of content of the magazine (articles and pictures). Which algorithm is the most appropriate for building a predictive model for subscribers?