D-DS-FN-23 無料問題集「EMC Dell Data Science Foundations」

A data scientist plans to classify the sentiment polarity of 10, 000 product reviews collected from the Internet.
What is the most appropriate model to use? Suppose labeled training data is available.

MapReduce is designed to process data in which way?

解説: (JPNTest メンバーにのみ表示されます)
Which word or phrase completes the statement? A Data Scientist would consider that a RDBMS is to a Table as R is to a ________.

What is a business driver for Big Data analytics adoption?

解説: (JPNTest メンバーにのみ表示されます)
Refer to the exhibit.

You are assigned to do an end of the year sales analysis of 1, 000 different products, based on the transaction table.
Which column in the end of year report requires the use of a window function?

Consider a database with the following four transactions:
Transaction 1: {cheese, bread, milk}
Transaction 2: {soda, bread, milk}
Transaction 3: {cheese, bread}
Transaction 4: {cheese, soda, juice}
Assuming a minimum support of 50%, which itemsets would be generated in the second step of the Apriori algorithm?

A logistic regression model is built to determine the probability of a credit card borrower defaulting on a credit loan. A threshold value of 0.3 is selected. Which statement can be used to predict a borrower will default?

解説: (JPNTest メンバーにのみ表示されます)
Which SQL OLAP grouping extension returns a result for each output row with 1 identifying a summary row and 0 identifying grouped rows?

解説: (JPNTest メンバーにのみ表示されます)
Consider the example of an analysis for fraud detection on credit card usage. You will need to ensure higher-risk transactions that may indicate fraudulent credit card activity are retained in your data for analysis, and not dropped as outliers during pre-processing.
What will be your approach for loading data into the analytical sandbox for this analysis?

You have plotted the distribution of savings account sizes for a bank.

Based on the distribution shown in the exhibit, how would you proceed?

In time series analysis, what is an indication of a stationary sequence?

What does the Receiver Operating Characteristic (ROC) curve show?

Which data asset is an example of quasi-structured data?

Consider the following itemsets:
(hat, scarf, coat)
(hat, scarf, coat, gloves)
(hat, scarf, gloves)
(hat, gloves)
(scarf, coat, gloves)
If the minimum support is 50%, what represents the complete list of frequent 2-itemsets?

Your organization has a website where visitors randomly receive one of two coupons. It is also possible that visitors to the website will not receive a coupon.
You have been asked to determine if offering a coupon to visitors to your website has any impact on their purchase decision.
Which analysis method should you use?

Data visualization is used in the final presentation of an analytics project.
For what else is this technique commonly used?

You are using k-means clustering to classify heart patients for a hospital. You have chosen Patient Sex, Height, Weight, Age and Income as measures and have used 3 clusters.
When you create a pair-wise plot of the clusters, you notice that there is significant overlap between the clusters.
What should you do?

弊社を連絡する

我々は12時間以内ですべてのお問い合わせを答えます。

オンラインサポート時間:( UTC+9 ) 9:00-24:00
月曜日から土曜日まで

サポート:現在連絡