DEA-7TT2無料問題集「EMC Associate - Data Science and Big Data Analytics v2」

質問 1

You are using MADlib for Linear Regression analysis. Which value does the statement return?
SELECT (linregr(depvar, indepvar)).r2 FROM zeta1;
Response:

（A）Standard error

（B）P-value

（C）Coefficients

（D）Goodness of fit

正解：D 解答を投票する

質問 2

During a study to understand the population growth of a certain bacterial culture, you plot the data and identify a quadratic growth trend over time. Which transformation should you apply to linearize the data?
Response:

（A）Square root

（B）Add a constant

（C）Square

（D）Cube

正解：A 解答を投票する

質問 3

Under which circumstance do you need to implement N-fold cross-validation after creating a regression model?
Response:

（A）There are missing values in the data.

（B）The data is unformatted.

（C）There are categorical variables in the model.

（D）There is not enough data to create a test set.

正解：D 解答を投票する

質問 4

What is the primary function of the NameNode in Hadoop?
Response:

（A）Monitors the state of each JobTracker node and signals an event if unavailable

（B）Keeps track of which MapReduce jobs have been assigned to each TaskTracker

（C）Runs some number of mapping tasks against its assigned data

（D）Acts as a regulator/resolver among clients and DataNodes

正解：D 解答を投票する

質問 5

What is the difference between the array and list data structures in R?
Response:

（A）Arrays are N-dimensional;Lists are only 2-dimensional

（B）Arrays contain only the same data type;Lists can contain different data types

（C）Arrays can contain different data types;Lists can contain only the same data type

（D）Arrays are only 2-dimensional;Lists are N-dimensional

正解：B 解答を投票する

質問 6

The average purchase size from your online sales site is $17, 200. The customer experience team believes a certain adjustment of the website will increase sales.
A pilot study on a few hundred customers showed an increase in average purchase size of $1.47, with a significance level of p=0.1. The team runs a larger study, of a few thousand customers. The second study shows an increased average purchase size of $0.74, with a significance level of 0.03.
What is your assessment of this study?
Response:

（A）The change in purchase size is not practically important, and the good p-value of the second study is probably a result of the large study size.

（B）The change in purchase size is small, but may aggregate up to a large increase in profits over the entire customer base.

（C）The difference in the change in purchase size between the two studies is troubling; The team should run another, larger study.

（D）The p-value of the second study shows a statistically significant change in purchase size. The new website is an improvement.

正解：A 解答を投票する

質問 7

You have two tables of customers in your database. Customers in cust_table_1 were sent an e-mail promotion last year, and customers in cust_table_2 received a newsletter last year.
Customers can only be entered in once per table. You want to create a table that includes all customers, and any of the communications they received last year.
Which type of join would you use for this table?
Response:

（A）Full outer join

（B）Left outer join

（C）Inner join

（D）Cross join

正解：A 解答を投票する

質問 8

What does R code nv <- v[v < 1000] do?
Response:

（A）Sets nv to TRUE or FALSE depending on whether all elements of vector v are less than 1000

（B）Selects values of vector v less than 1000, modifies v, and makes a copy to nv

（C）Removes elements of vector v less than 1000 and assigns the elements >= 1000 to nv

（D）Selects the values in vector v that are less than 1000 and assigns them to the vector nv

正解：D 解答を投票する

質問 9

Data has been collected on visitors' viewing habits at a bank's website. Which technique is used to identify pages commonly viewed during the same visit to the website?
Response:

（A）Association Rules

（B）Classification

（C）Clustering

（D）Regression

正解：A 解答を投票する

質問 10

Which word or phrase completes the statement? Structured data is to OLAP data as quasi- structured data is to Response:

（A）Image files

（B）XML data

（C）Clickstream data

（D）Text documents

正解：C 解答を投票する

質問 11

In logistic regression modeling, what is the commonly assigned probability threshold used to assign a class label?
Response:

（A）0.25

（B）0.9

（C）0.5

（D）0.1

正解：C 解答を投票する

質問 12

Assume that you have a data frame in R. Which function would you use to display descriptive statistics about this variable?
Response:

（A）str

（B）levels

（C）summary

（D）attributes

正解：C 解答を投票する

質問 13

Consider this SQL statement:
SELECT product, prod_cost, avg(prod_cost) OVER (PARTITION BY product)
FROM product_detail
The OVER clause makes this what type of function?
Response:

（A）Window function

（B）System function

（C）User-defined function

（D）Aggregate function

正解：A 解答を投票する

質問 14

Which data asset is an example of semi-structured data?
Response:

（A）Webserver log

（B）XML data file

（C）Database table

（D）News article

正解：B 解答を投票する

質問 15

What is required in a presentation for business analysts?
Response:

（A）The presentation author,s credentials

（B）Detailed statistical explanation of the applicable modeling theory

（C）Budgetary considerations and requests

（D）Operational process changes

正解：D 解答を投票する

質問 16

You are using the Apriori algorithm to determine the likelihood that a person who owns a home has a good credit score. You have determined that the confidence for the rules used in the algorithm is > 75%. You calculate lift = 1.011 for the rule, "People with good credit are homeowners".
What can you determine from the lift calculation?
Response:

（A）The rule is true

（B）Leverage of the rules is low

（C）The rule is coincidental

（D）Support for the association is low

正解：C 解答を投票する

質問 17

In addition to less data movement and the ability to use larger datasets in calculations, what is a benefit of analytical calculations in a database?
Response:

（A）more efficient handling of categorical values

（B）improved connections between disparate data sources

（C）full use of data aggregation functionality

（D）quicker time to insight

正解：D 解答を投票する

DEA-7TT2 無料問題集「EMC Associate - Data Science and Big Data Analytics v2」

弊社を連絡する

関連リンク

トップ試験