OSCM 4250 TUT Supply Chain Variable Relationship & Variance of Clusters Exam Practice

User Generated

Zbun14098023

Business Finance

OSCM 4250

The University of Toledo

OSCM

Description

  • We want to visually compare the distributions of the blood types for males and females, which of the following is suitable? (data format see below)
  • Based on the dendrogram, what is the number of clusters that works better?
  • Based on the box plot, which of the following statement is true?
  • We want to remove outliers before performing one-sample t-test, which of the following are valid procedures?
  • To check if a linear correlation exists between two scale variables, we can run the Pearson correlation directly. We do not have to draw the scatter chart if we do not need visualization in the report. True Or False
  • The dataset has 20,000 observations, Hence, we use the following to test normality
  • Which of the following statement about Spearman’s Rank Order Correlation Test is not correct?
  • We can use Q-Q plot to compare our data distribution against a Poisson distribution. True Or False
  • When measuring the similarity between clusters, which one should be chosen if we want to measure the shortest distance between clusters
  • 11- When clustering observations with only binary variables, we can calculate the similarity based on which of the follows to reduce the effect of the absence of feature.
  • 17- Download the file:ex2.sav.
  • 18- Download the file: ex3.sav .
  • 19- Download the file: ex1.sav

Gender

ABO

M

A

M

O

F

B

F

O

M

A

A.

Cross-tab

B.

Histogram

C.

Box Plot

D.

Line Chart

2-How many samples should we collect for the following scenario? We want to know the proportion of students who were sick at least one time during the semester with the level of precision (5%) and 95 percent confidence level (i.e., the corresponding z=1.96) assuming the estimated proportion is 0.5. Round up your answer to the integer.

  • 3
  • 4
  • 5
  • 6

a)

The central line does not exist for the female group because there is no sufficient data to determine it

b)

The test distribution of the male is positively skewed

c)

The mean test score for the male is 2

d.

The IQR for the female is 5

a.Box plot, z-score, and scatter chart

b.Box plot and z-score

c.Standardized variable and scatter plot

d.Box plot

a.Kolmogorov-Smirnov Test

b.Shapiro-Wilk Test

c.Q-Q Plot

d.All of the above are suitable

a.If Pearson correlation shows a positive correlation, Spearman’s test will show a positive correlation too

b.It can also be applied to the ordinal categorical variable but not the nominal variable

c.It can be used when the two variables to be tested are not normally distributed. However, they still need to have a linear relationship

d.All of the above is incorrect

a.Single linkage

b.Complete linkage

c.Group Average linkage

d.Centroid linkage


a.Squared Euclidean Distance

b.Euclidean Distance

c.Jaccard's Coefficient

d.Matching Coefficient

12-When we run the one-sample t-test to check if the population mean of income is equal to 50000. Suppose that the 95% C.I. of difference provided on the output table is [500, 900], which of the follows is likely to be the correct p-value?

a.0.035

b.0.065

c.0.535

d.All are possible

13-We want to find out if the average commute times are different in the residents' zip codes (43615, 48103, and 48105), which of the following is more suitable given that the commute time is normally distributed within each zip code

a.Paired-samples t-test

b.One-way ANOVA

c.Two-way ANOVA

d.Independent-sample t-test

14-The output for the paired t-test is provided below. Our H1is that X7 and X4 are different. Based on the output, which of the following is the correct conclusion

a.There is no significant evidence to show that X7 is different from X4 (p=0.076)

b.We conclude that X7 is significantly different from X4 (p=0.038)

c.Neither is correct

15-Based on the result below, we should report the t-test results from

a.The first row

b.The second row

c.Either is ok

16- Download the file: ex1.sav .

Run the Kolmogorov-Smirnov Normality Test on X1 for each gender, what is the significant level for males?

a.0.097

b.0.179

c.0.035

d.0.024

Run a one-sample t-test to see if the mean for X3 is different from 140. Report the t statistic.

a.0.202

b.0.830

c.0.513

d.0.246

Aggregate the data by Gender. What is the standard deviation of X2 for males? (enter two decimals point e.g., 35.24)



Report the Spearman's correlation between X1 and X3 (report the exact value. Do not round.)

20-We plan to run Mann-Whitney Test to compare the savings for males and females. Check the assumptions for the attached data. Based on your finding, the Mann-Whitney Test will compare:


Data:PP.sav

a.mean

b.mode

c.variance

d.median

21-We want to run a nonparametric test to compare the saving distributions for residents in Perrysburg and Ottawa Hills. Select the appropriate test taught in class and report the p-value.

ind_t.sav

22-Calculate the mean for X1 (format:0.00) after removing or filtering out all observations that are two times away from the standard deviation.

mean.sav

23-Run the K-mean cluster to divide observations into 2 groups based on age and height. What is the average age for the group with C in it? (format: x.xx)

cluster.sav

24-Answer the following question based on your K-mean cluster analysis in the previous question. Which of the following is true? (choose any method you like in SPSS)

a.The cluster that includes C is considered older and taller.

b.The cluster that includes C is considered older and shorter.

c.The cluster that includes C is considered younger and taller.

d.The cluster that includes C is considered younger and shorter.

Pearson correlation can also help us determine how fast one variable can be affected by the other one.True or

Unformatted Attachment Preview

1- We want to visually compare the distributions of the blood types for males and females, which of the following is suitable? (data format see below) Gender M M F F M ABO A O B O A A. Cross-tab B. Histogram C. Box Plot D. Line Chart 2- How many samples should we collect for the following scenario? We want to know the proportion of students who were sick at least one time during the semester with the level of precision (5%) and 95 percent confidence level (i.e., the corresponding z=1.96) assuming the estimated proportion is 0.5. Round up your answer to the integer. 3- Based on the dendrogram, what is the number of clusters that works better? a. 3 b. 4 c. 5 d. 6 4- Based on the box plot, which of the following statement is true? a) The central line does not exist for the female group because there is no sufficient data to determine it b) The test distribution of the male is positively skewed c) The mean test score for the male is 2 d. The IQR for the female is 5 5- We want to remove outliers before performing one-sample t-test, which of the following are valid procedures? a.Box plot, z-score, and scatter chart b.Box plot and z-score c.Standardized variable and scatter plot d.Box plot 6- To check if a linear correlation exists between two scale variables, we can run the Pearson correlation directly. We do not have to draw the scatter chart if we do not need visualization in the report. True Or False 7- The dataset has 20,000 observations, Hence, we use the following to test normality a.Kolmogorov-Smirnov Test b.Shapiro-Wilk Test c.Q-Q Plot d.All of the above are suitable 8- Which of the following statement about Spearman’s Rank Order Correlation Test is not correct? a.If Pearson correlation shows a positive correlation, Spearman’s test will show a positive correlation too b.It can also be applied to the ordinal categorical variable but not the nominal variable c.It can be used when the two variables to be tested are not normally distributed. However, they still need to have a linear relationship d.All of the above is incorrect 9- We can use Q-Q plot to compare our data distribution against a Poisson distribution. True Or False 10- When measuring the similarity between clusters, which one should be chosen if we want to measure the shortest distance between clusters a.Single linkage b.Complete linkage c.Group Average linkage d.Centroid linkage 11- When clustering observations with only binary variables, we can calculate the similarity based on which of the follows to reduce the effect of the absence of feature. a.Squared Euclidean Distance b.Euclidean Distance c.Jaccard's Coefficient d.Matching Coefficient 12- a. b. c. d. 13- a. b. c. d. 14- When we run the one-sample t-test to check if the population mean of income is equal to 50000. Suppose that the 95% C.I. of difference provided on the output table is [500, 900], which of the follows is likely to be the correct p-value? 0.035 0.065 0.535 All are possible We want to find out if the average commute times are different in the residents' zip codes (43615, 48103, and 48105), which of the following is more suitable given that the commute time is normally distributed within each zip code Paired-samples t-test One-way ANOVA Two-way ANOVA Independent-sample t-test The output for the paired t-test is provided below. Our H1 is that X7 and X4 are different. Based on the output, which of the following is the correct conclusion a. There is no significant evidence to show that X7 is different from X4 (p=0.076) b. We conclude that X7 is significantly different from X4 (p=0.038) c. Neither is correct 15- Based on the result below, we should report the t-test results from a. The first row b. c. The second row Either is ok 16- Download the file: ex1.sav . Run the KolmogorovSmirnov Normality Test on X1 for each gender, what is the significant level for males? a. 0.097 b. 0.179 c. 0.035 d. 0.024 17- Download the file:ex2.sav. Run a one-sample t-test to see if the mean for X3 is different from 140. Report the t statistic. a. 0.202 b. 0.830 c. 0.513 d. 0.246 18- Download the file: ex3.sav . Aggregate the data by Gender. What is the standard deviation of X2 for males? (enter two decimals point e.g., 35.24) 19- Download the file: ex1.sav Report the Spearman's correlation between X1 and X3 (report the exact value. Do not round.) 20- We plan to run Mann-Whitney Test to compare the savings for males and females. Check the assumptions for the attached data. Based on your finding, the Mann-Whitney Test will compare: Data:PP.sav a. mean b. mode c. variance d. median 21- We want to run a nonparametric test to compare the saving distributions for residents in Perrysburg and Ottawa Hills. Select the appropriate test taught in class and report the p-value. ind_t.sav 22- Calculate the mean for X1 (format:0.00) after removing or filtering out all observations that are two times away from the standard deviation. mean.sav 23- Run the K-mean cluster to divide observations into 2 groups based on age and height. What is the average age for the group with C in it? (format: x.xx) cluster.sav 24- Answer the following question based on your K-mean cluster analysis in the previous question. Which of the following is true? (choose any method you like in SPSS) a. The cluster that includes C is considered older and taller. b. The cluster that includes C is considered older and shorter. c. The cluster that includes C is considered younger and taller. d. The cluster that includes C is considered younger and shorter. 25- Pearson correlation can also help us determine how fast one variable can be affected by the other one.True or False
Purchase answer to see full attachment
User generated content is uploaded by users for the purposes of learning and should be used following Studypool's honor code & terms of service.

Explanation & Answer

Attached. Please let me know if you have any questions or need revisions.

1. The answer is A. A cross-tab shows the relationship between two or
more variables. Cross tabulations enables you to examine
relationships within data that might not be readily apparent when
analyzing total survey responses.
2. ( (1.96)2 *0.5(0.5))/(0.05)2
(3.8416*0.25)/0.0025
0.9604/0.0025
384.16
Rounding off gives a...

Related Tags