DEA-7TT2 無料問題集「EMC Associate - Data Science and Big Data Analytics v2」

You are using MADlib for Linear Regression analysis. Which value does the statement return?
SELECT (linregr(depvar, indepvar)).r2 FROM zeta1;
Response:

During a study to understand the population growth of a certain bacterial culture, you plot the data and identify a quadratic growth trend over time. Which transformation should you apply to linearize the data?
Response:

Under which circumstance do you need to implement N-fold cross-validation after creating a regression model?
Response:

What is the primary function of the NameNode in Hadoop?
Response:

What is the difference between the array and list data structures in R?
Response:

The average purchase size from your online sales site is $17, 200. The customer experience team believes a certain adjustment of the website will increase sales.
A pilot study on a few hundred customers showed an increase in average purchase size of $1.47, with a significance level of p=0.1. The team runs a larger study, of a few thousand customers. The second study shows an increased average purchase size of $0.74, with a significance level of 0.03.
What is your assessment of this study?
Response:

You have two tables of customers in your database. Customers in cust_table_1 were sent an e-mail promotion last year, and customers in cust_table_2 received a newsletter last year.
Customers can only be entered in once per table. You want to create a table that includes all customers, and any of the communications they received last year.
Which type of join would you use for this table?
Response:

What does R code nv <- v[v < 1000] do?
Response:

Data has been collected on visitors' viewing habits at a bank's website. Which technique is used to identify pages commonly viewed during the same visit to the website?
Response:

Which word or phrase completes the statement? Structured data is to OLAP data as quasi- structured data is to Response:

In logistic regression modeling, what is the commonly assigned probability threshold used to assign a class label?
Response:

Assume that you have a data frame in R. Which function would you use to display descriptive statistics about this variable?
Response:

Consider this SQL statement:
SELECT product, prod_cost, avg(prod_cost) OVER (PARTITION BY product)
FROM product_detail
The OVER clause makes this what type of function?
Response:

Which data asset is an example of semi-structured data?
Response:

What is required in a presentation for business analysts?
Response:

You are using the Apriori algorithm to determine the likelihood that a person who owns a home has a good credit score. You have determined that the confidence for the rules used in the algorithm is > 75%. You calculate lift = 1.011 for the rule, "People with good credit are homeowners".
What can you determine from the lift calculation?
Response:

In addition to less data movement and the ability to use larger datasets in calculations, what is a benefit of analytical calculations in a database?
Response:

弊社を連絡する

我々は12時間以内ですべてのお問い合わせを答えます。

オンラインサポート時間:( UTC+9 ) 9:00-24:00
月曜日から土曜日まで

サポート:現在連絡