# Biostatistics

With this database, you will:

1. Identify and list categorical and continuous variables.
2. Calculate frequencies and percentages for all of the categorical variables, and means, and standard deviations for the continuous variables.
3. Run a one-sample z test of proportions for one categorical variable. Here is an example in terms of wording: "Is the proportion of English speaking patients greater than 50%?" Using .05 level of significant test the appropriate hypotheses.
1. Write the null and alternate hypotheses
2. What is the proportion of English speaking patients
3. What is the standard error and z statistics
4. Find critical z and p value. Is the p value significant? Conclusion?
4. Run an independent samples t-test to compare two groups of people (specified by a dichotomous variable) on one continuous variable. Using .05 level of significant test the appropriate hypotheses. In order to run this test statistics you have to re-arrange your data.
1. Write the null and alternate hypotheses.
2. What is the calculated mean for each group?
3. What is the confidence interval for each group?
4. What is the critical t value?
5. Is the p value significant? Conclusion?

ID AA-31 AD-29 AF-02 AG-25 AG-45 AM-01 BS-52 CC-63 CK-42 CN-18 CR-03 DC-09 DH-33 EG-24 EG-46 EK-34 EM-51 FA-50 FP-04 GC-05 GK-11 GV-23 HC-10 HK-12 HL-17 HL-55 IH-54 JB-47 JC-22 JC-39 JJ-49 JK-13 JK-36 JMC-56 KC-59 KK-14 KK-35 KL-57 KP-64 KS-20 LC-06 MG-30 ML-41 MP-48 MR-32 MW-67 Age 74.00 73.00 77.00 72.00 66.00 76.00 72.00 77.00 75.00 79.00 71.00 71.00 66.00 67.00 79.00 66.00 71.00 78.00 67.00 67.00 84.00 88.00 79.00 86.00 81.00 72.00 77.00 82.00 86.00 73.00 76.00 81.00 75.00 73.00 78.00 81.00 83.00 71.00 72.00 66.00 67.00 68.00 72.00 70.00 63.00 73.00 Language HAQ4c NumRegMed HAQ15a SmokHt 1.00 0 1.00 1 1 1.00 0 0.00 1 1 1.00 0 3.00 2 2 1.00 0 5.00 1 1 1.00 0 0.00 1 1 1.00 0 3.00 1 1 1.00 0 1.00 1 1 2.00 0 10.00 2 2 2.00 1 7.00 3 1 2.00 1 4.00 1 1 1.00 0 3.00 2 2 2.00 0 2.00 2 2 1.00 0 0.00 1 1 1.00 0 0.00 1 1 1.00 0 0.00 1 1 2.00 0 4.00 1 1 1.00 0 0.00 1 1 1.00 1 6.00 1 1 1.00 0 0.00 1 1 1.00 1 2.00 1 1 2.00 0 8.00 3 2 1.00 0 1.00 3 2 2.00 0 7.00 1 1 2.00 0 4.00 1 1 2.00 0 1.00 3 2 2.00 0 3.00 1 1 2.00 1 0.00 1 1 1.00 0 0.00 3 2 1.00 0 0.00 3 2 2.00 0 1.00 2 2 1.00 0 0.00 1 1 2.00 0 5.00 3 2 2.00 0 1.00 1 1 2.00 0 0.00 1 1 2.00 1 5.00 2 2 2.00 0 3.00 3 1 2.00 0 4.00 3 1 2.00 0 18.00 2 2 2.00 1 9.00 3 1 2.00 0 3.00 2 2 1.00 0 4.00 1 1 1.00 0 0.00 1 1 2.00 0 5.00 1 1 1.00 0 6.00 1 1 1.00 0 0.00 1 1 2.00 0 1.00 1 1 OY-65 RG-07 RG-26 RO-53 RP-28 RR-27 SHC-58 SIC-62 SJ-08 SJ-43 SK-15 SKC-61 SWK-66 TP-19 WC-40 YJK-37 YK-16 YK-60 YL-68 YYK-38 71.00 87.00 75.00 71.00 84.00 68.00 74.00 70.00 63.00 71.00 79.00 71.00 79.00 81.00 84.00 75.00 89.00 79.00 69.00 71.00 2.00 1.00 1.00 1.00 1.00 1.00 2.00 2.00 1.00 2.00 2.00 2.00 2.00 2.00 2.00 2.00 2.00 2.00 2.00 2.00 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 0 0 1 0 0 0.00 0.00 5.00 1.00 0.00 0.00 14.00 4.00 5.00 4.00 6.00 6.00 9.00 3.00 5.00 7.00 3.00 0.00 0.00 4.00 2 3 1 1 3 1 1 2 2 2 1 2 2 1 1 1 3 2 1 1 2 2 1 1 2 1 1 2 2 2 1 2 2 1 1 1 2 2 1 1 Name ID Age Language HAQ4c NumRegMed HAQ15a SmokHt Description Participant ID number Age in years Preferred language Fractured hip, leg, arm, or spine Number of medications used regularly Ability to take care of own personal needs Past Smoke history Answers Actual age 1= English 2= Spanish 0= No, 1= Yes Total number of meds 1= can do, 2= can do with some difficulty, 3= can not do without help 1= nonsmoker 2=smoker
1. Identify and list categorical and continuous variables.
Variable Name
Age
Language
HAQ4c
NumRegMed
HAQ15a
SmokHt

Description
Age in years
Preferred language
Fractured hip, leg, arm,
or spine
Number of medications
used regularly
Ability to take care of
own personal needs
Past Smoke history

Variable type
continuous
Categorical
Categorical
Continuous
Categorical
Categorical

2. Calculate frequencies and percentages for all of the categorical variables, and means, and
standard deviations for the continuous variables.

NumRegMed

mean
standard deviation

3.27
3.554176

Age
mean
standard deviation

74.73
6.364675

Language

Frequency
English
Spanish

percentage
frequency
29
43.94%
37
56.06%

HAQ4c

Frequency

percentage
frequency

No
Yes

55
11

83.33%
16.67%

HAQ15a

can do
can do with some
difficulty
can not do without help

percentage
Frequency
frequency
38
57.58%
15
13

22.73%
19.70%

SmokHt

Non smoker
Smoker

percentage
Frequency
frequency
42
63.64%
24
36.36%

3. Run a one-sample z test of proportions for one categorical variable. Here is an example in
terms of wording: "Is the proportion of English speaking patients greater than 50%?"
Using .05 level of significant test the appropriate hypotheses.
1. Write the null and alternate hypotheses
Ho: The proportion of English speaking patients is equal to 50%
Ho: p= 50%
Ha: The proportion of English speaking patients is greater than 50%?"
Ha: p>50%
2. What is the proportion of English speaking patients

p= 29/66 = 0.4394
3. What is the standard error and z statistics

𝑝∗(1−𝑝)
)
𝑛

Standard error = √

=√

0.4394∗0.5606
66

= 0.0611

z statistic = (p-hat –p)/ standard error = (0.4394-0.5)/0.0611 = -0.99

4. Find critical z and p value. Is the p value significant? Conclusion?
Z critical = 1.645
p value = p(z>-0.99) = 0.8389
The p value is not significant as it is greater than the significance value (0.05). We
thus fail to reject the null and conclude that there is insufficient evidence t...

