DEA-7TT2 無料問題集「EMC Associate - Data Science and Big Data Analytics v2」

A call center for a large electronics company handles an average of 35, 000 support calls a day. The head of the call center would like to optimize the staffing of the call center during the rollout of a new product due to recent customer complaints of long wait times.
You have been asked to create a model to optimize call center costs and customer wait times. The goals for this project include:
1. Relative to the release of a product, how does the call volume change over time?
2. How to best optimize staffing based on the call volume for the newly released product, relative to old products.
3. Historically, what time of day does the call center need to be most heavily staffed?
4. Determine the frequency of calls by both product type and customer language.
Which goals are suitable to be completed with MapReduce?
Response:

Refer to exhibit.

You are asked to write a report on how specific variables impact your client's sales using a data set provided to you by the client. The data includes 15 variables that the client views as directly related to sales, and you are restricted to these variables only.
After a preliminary analysis of the data, the following findings were made:
1. Multicollinearity is not an issue among the variables
2. Only three variables A, B, and C have significant correlation with sales You build a linear regression model on the dependent variable of sales with the independent variables of A, B, and C. The results of the regression are seen in the exhibit. You cannot request additional data.
What is a way that you could try to increase the R2 of the model without artificially inflating it?
Response:

Which word or phrase completes the statement? Data-ink ratio is to data visualization as _________.
Response:

You are using k-means clustering to classify heart patients for a hospital. You have chosen Patient Sex, Height, Weight, Age and Income as measures and have used 3 clusters.
When you create a pair-wise plot of the clusters, you notice that there is significant overlap between the clusters. What should you do?
Response:

Which characteristic applies only to Business Intelligence as opposed to Data Science?
Response:

The average purchase size from your online sales site is $17, 200. The customer experience team believes a certain adjustment of the website will increase sales.
A pilot study on a few hundred customers showed an increase in average purchase size of $1.47, with a significance level of p=0.1. The team runs a larger study, of a few thousand customers. The second study shows an increased average purchase size of $0.74, with a significance level of 0.03.
What is your assessment of this study?
Response:

You have fit a decision tree classifier using 12 input variables. The resulting tree used 7 of the 12 variables, and is 5 levels deep. Some of the nodes contain only 3 data points. The AUC of the model is 0.85.
What is your evaluation of this model?
Response:

How are window functions different from regular aggregate functions?
Response:

In addition to less data movement and the ability to use larger datasets in calculations, what is a benefit of analytical calculations in a database?
Response:

What does a leaf node represent in a decision tree?
Response:

You have scored your Naive Bayesian Classifier model on "hold out" test data for cross validation. You have determined the way the samples scored and have tabulated them as shown in the exhibit.

What are the Precision and Recall rates of the model?
Response:

Consider a scale that has five (5) values that range from "not important" to "very important". Which data classification best describes this data?
Response:

Refer to the exhibit.

The exhibit shows four graphs labeled as Fig A thorough Fig D. Which figure represents the entropy function relative to a Boolean classification and is represented by the formula shown in Exhibit?
Response:

Refer to the exhibit.

You have run a linear regression model against your data, and have plotted true outcome versus predicted outcome. The R-squared of your model is 0.75. What is your assessment of the model?
Response:

Which word or phrase completes the statement; "A data scientist would consider a RDBMS is to a table as R is to a_____."?
Response:

In addition to quantitative and technical skills, what is a key aspect of the profile of a data scientist?
Response:

Which word or phrase completes the statement; "Excessive emphasis color is to Bar chart as __________________."?
Response:

What is a consideration when building decision trees?
Response:

弊社を連絡する

我々は12時間以内ですべてのお問い合わせを答えます。

オンラインサポート時間:( UTC+9 ) 9:00-24:00
月曜日から土曜日まで

サポート:現在連絡