Math 533, course project part A to part C
Course Project: Hospital Stays, Impact of Education, Health Data
Introduction  PROJECT PART A: Exploratory Data Analysis  Project Part A: Grading Rubric  Project Part B: Hypothesis Testing and Confidence Intervals  Project Part B: Grading Rubric  Project Part C: Regression and Correlation Analysis  Project Part C: Grading Rubric
Introduction Back to Top
In this case, 40 individuals visited the hospital looking for information and education regarding their diabetes. Since diabetes relates to the body’s ability to use glucose, their glucose level was measured. These patience took a knowledge test about diabetes before and after being given some educational materials. Satisfaction with their overall visit was also measured through a survey. The data appear below for your reference. The Excel spreadsheet with these data can be found in Doc Sharing and should be opened in Minitab to complete the analyses required in each part of this project.
Hosp_Stay 
Hosp_Satisfaction 
Diab_Pretest 
Diab_Posttest 
Glucose 
2 
VeryDissat 
34 
39 
122 
3 
SWDissat 
33 
30 
116 
1 
SWSat 
29 
22 
108 
1 
SWSat 
17 
14 
63 
7 
SWDissat 
69 
60 
74 
7 
VerySat 
69 
70 
84 
8 
SWDissat 
81 
82 
67 
8 
VerySat 
81 
88 
57 
9 
SWSat 
52 
100 
231 
4 
SWDissat 
45 
48 
93 
1 
SWSat 
38 
28 
76 
1 
VerySat 
27 
33 
163 
6 
VeryDissat 
45 
54 
217 
8 
SWDissat 
58 
71 
112 
7 
SWDissat 
68 
84 
107 
6 
SWDissat 
48 
49 
95 
1 
VerySat 
9 
15 
104 
8 
VerySat 
50 
51 
63 
6 
SWDissat 
65 
51 
189 
1 of 6
9/24/2014 6:08 AM
MATH399 Course Home Course Project (NOV14) https://devry.equella.ecollege.com/file/cd2c455dffcf411cae8d2c5175...
6 
SWDissat 
64 
78 
53 
1 
VerySat 
25 
30 
96 
5 
VerySat 
50 
45 
102 
2 
VeryDissat 
47 
41 
142 
2 
VeryDissat 
24 
18 
133 
3 
VerySat 
35 
30 
88 
1 
SWDissat 
29 
31 
87 
4 
SWDissat 
39 
33 
161 
4 
SWDissat 
53 
42 
104 
5 
VeryDissat 
57 
49 
92 
4 
SWSat 
50 
50 
168 
8 
VeryDissat 
64 
100 
72 
6 
VeryDissat 
90 
95 
100 
8 
SWDissat 
52 
63 
63 
9 
SWSat 
52 
64 
132 
9 
VerySat 
67 
77 
171 
8 
SWSat 
68 
68 
163 
5 
SWSat 
67 
60 
84 
7 
SWSat 
44 
53 
102 
7 
VeryDissat 
57 
51 
130 
3 
SWSat 
58 
50 
71 
The variables are

Hospital stay – number of days in hospital

Hospital satisfaction – level of satisfaction with hospital services during stay (very highly satisfied, highly
satisfied, somewhat dissatisfied, and very dissatisfied)

Diabetes Pretest – score on diabetes knowledge test (must score 95 or higher to leave)

Diabetes Posttest – score on diabetes knowledge test after receiving educational material created for
this study

Glucose – blood glucose in mg/dL.
The data are available in Doc Sharing Course Project Data Set as an Excel file. You are to copy and paste the data set into a Minitab Worksheet.
PROJECT PART A: Exploratory Data Analysis Back to Top
Open the file MATH533_Course_Project_Data_HOSPITAL.xlsx from Doc Sharing.
Summarize the data for each of the five variables. For each variable, find the mean, median, variance,
and standard deviation. Use Minitab as appropriate, and explain what the results mean. These
calculations may not be possible for each variable. If you cannot calculate any or all of these for a
specific variable, please note that and state why.
2 of 6
9/24/2014 6:08 AM
MATH399 Course Home Course Project (NOV14) https://devry.equella.ecollege.com/file/cd2c455dffcf411cae8d2c5175...
Analyze the connections or relationships between two variables. There are ten pairings possible here (Hospital stay and hospital satisfaction, hospital stay and diabetes pretest, hospital stay and diabetes posttest, hospital stay and glucose, hospital satisfaction and diabetes pretest, hospital satisfaction and diabetes posttest, hospital satisfaction and glucose, diabetes pretest and diabetes posttest, diabetes pretest and glucose, and diabetes posttest and glucose ). Choose two of these pairings and find the correlation coefficient and show the scatter plot. Explain what you see. Some variables show clear relationships, while others do not.
Prepare your report in Microsoft Word (or some other word processing package), integrating your graphs and tables with text explanations and interpretations. Be sure that you have graphical and numerical back up for your explanations and interpretations. Be selective in what you include in the report. You should not generate a 20 page report on every variable and every possible relationship. Rather what you should do is to highlight what you see for the individual and two or three sentences of interpretation. For the two pairings you selected, identify and report your results using graphical and numerical summary (as appropriate), with interpretations.
Project Part B: Hypothesis Testing and Confidence Intervals Back to Top Your manager has speculated the following:

the average (mean) level of glucose is less than 130

the true population proportion of number somewhat satisfied with their hospital stay is greater than 28%,

the average (mean) score on the diabetes pretest is greater than 52.

the average (mean) score on the diabetes posttest ONLY among those that are "somewhat satisfied" is
less than 75.

Using the sample data, perform the hypothesis test for each of the above situations in order to
see if there is evidence to support your manager’s belief in each case a.d. In each case use the Seven Elements of a Test of Hypothesis, in Section 6.2 of your textbook with α = .05, and explain your conclusion in simple terms. Also be sure to compute the pvalue and interpret.

Follow this up with computing 95% confidence intervals for Glucose and Diabetes Posttest, and again interpreting these intervals.

Write a report to your manager about the results, distilling down the results in a way that would be understandable to someone who does not know statistics. Clear explanations and interpretations are critical.

All DeVry University policies are in effect, including the plagiarism policy.
Project Part B report is due by the end of Week 6.
Project Part B is worth 100 total points. See grading rubric below.
Submission: The report from part 3 + all of the relevant work done in the hypothesis testing (including Minitab) in 1, and the confidence intervals (Minitab) in 2 as an appendix.
Format for report:

Summary Report (about 1 paragraph on each of the speculations a.d.)

Appendix with all of the steps in hypothesis testing (the format of the Seven Elements of a Test of
Hypothesis, in Section 6.2 of your textbook) for each speculation a.d. as well as the confidence intervals, and including all Minitab output
Project Part C: Regression and Correlation Analysis Back to Top Using Minitab perform the regression and correlation analysis for the data on diabetes posttest (Y), the
dependent variable, and diabetes pretest (X), the independent variable, by answering the following.

Generate a scatterplot for diabetes posttest (Y) vs. diabetes pretest (X) including the graph of the "best fit" line. Interpret.

Determine the equation of the "best fit" line, which describes the relationship between diabetes posttest and diabetes pretest.

Determine the coefficient of correlation. Interpret.

Determine the coefficient of determination. Interpret.

Test the utility of this regression model (use a two tail test with α =.05). Interpret your results, including
the pvalue.

Based on your findings in 15, what is your opinion about using diabetes pretest to predict diabetes
postest? Explain.

Compute the 95% confidence interval for beta1 (the population slope). Interpret this interval. In an
attempt to improve the model, we attempt to do a multiple regression model predicting diabetes posttest
based on diabetes pretest and glucose.

Using Minitab, run the multiple regression analysis using the variables diabetes pretest and glucose to
predict diabetes posttest. State the equation for this multiple regression model.

Perform the Global Test for Utility (FTest). Explain your conclusion.

Perform the ttest on each independent variable. Explain your conclusions and clearly state how you should proceed. In particular, which independent variables should we keep and which should be discarded.

Is this multiple regression model better than the linear model that we generated in parts 17? Explain.
All DeVry University policies are in effect, including the plagiarism policy.
Project Part C report is due by the end of Week 7.
Project Part C is worth 100 total points. See grading rubric below.
Summarize your results from 111 in a report that is three pages or less in length and explains and interprets the results in ways that are understandable to someone who does not know statistics.
5 of 6 9/24/2014 6:08 AM
MATH399 Course Home Course Project (NOV14) https://devry.equella.ecollege.com/file/cd2c455dffcf411cae8d2c5175...
Submission: The summary report + all of the work done in 111 (Minitab Output + interpretations) as an appendix.
Format:
A. Summary Report
B. Points 111 addressed with appropriate output, graphs and interpretations. Be sure to number each
point 111.
Project Part C: Grading Rubric Back to Top
Category 
Points 
% 
Description 
Questions 1  9 and 11, are worth 6.5 pts. each 
65 
65 
addressed with appropriate output, graphs and interpretations 
Question 10 
15 
15 
addressed with appropriate output, graphs and interpretations 
Summary 
20 
20 
writing, grammar, clarity, logic, and cohesiveness 
Total 
100 
100 
A quality paper will meet or exceed all of the above requirements. 
Back to Top