STAT 250 Data Analysis Assignment 2

User Generated

Qrfvtny2

Mathematics

STAT 250

Description

STAT 250 Spring 2019 Data Analysis Assignment 2

Your submitted document should include the following items. Points will be deducted if the following are not included.

  1. Type your Name and STAT 250 with your correct section number (e.g. STAT 250-xxx) right justified and then Data Analysis Assignment #2 centered on the top of page 1 below your name the begin your document.
  2. Number your pages across your entire solutions document.
  3. Your document should include the ANSWERS ONLY with each answer labeled by its corresponding number and subpart. Keep the answers in order. Do not include the questions in your submitted document.
  4. Generate all requested graphs and tables using StatCrunch.
  5. Upload your document onto Blackboard as a Word (docx) file or pdf file using the link provided by your instructor. It is your responsibility for uploading a readable file.

Full assignment Instructions, as well as a example is attached as a word file.

Access to StatCrunch is required.

https://www.statcrunch.com/5.0/group.php?groupid=7...

I will provide the login info...


Extra Notes:

- Each graph title should start with "Distribution of.."

- For the questions that require calculation, you can do it on a paper but would have to type the solution into word document.

Unformatted Attachment Preview

STAT 250 Spring 2019 Data Analysis Assignment 2 Your submitted document should include the following items. Points will be deducted if the following are not included. 1. Type your Name and STAT 250 with your correct section number (e.g. STAT 250-xxx) right justified and then Data Analysis Assignment #2 centered on the top of page 1 below your name the begin your document. 2. Number your pages across your entire solutions document. 3. Your document should include the ANSWERS ONLY with each answer labeled by its corresponding number and subpart. Keep the answers in order. Do not include the questions in your submitted document. 4. Generate all requested graphs and tables using StatCrunch. 5. Upload your document onto Blackboard as a Word (docx) file or pdf file using the link provided by your instructor. It is your responsibility for uploading a readable file. 6. You may not work with other individuals on this assignment. It is an honor code violation if you do. Elements of good technical writing: Use complete and coherent sentences to answer the questions. Graphs must be appropriately titled and should refer to the context of the question. Graphical displays must include labels with units if appropriate for each axis. Units should always be included when referring to numerical values. When making a comparison you must use comparative language, such as “greater than”, “less than”, or “about the same as.” Ensure that all graphs and tables appear on one page and are not split across two pages. Type all mathematical calculations when directed to compute an answer ‘by-hand.’ Pictures of actual handwritten work are not accepted on this assignment. When writing mathematical expressions into your document you may use either an equation editor or common shortcuts such as: x can be written as sqrt(x), p̂ can be written as p-hat, x can be written as x-bar. 1 Problem 1: Game Spinner We will be comparing empirical (relative frequencies based on an observation of a real-life process) to theoretical (long-run relative frequency) probabilities. We will use StatCrunch to simulate this process using a board game spinner three times so that we can determine the total number of spaces moved in three turns. The board game spinner looks like the image below. The spinner is equally likely to land on any given section. a) Build a probability distribution table for the result of a single spin. Present this completed table in your document. b) Simulate using the spinner 3 times by following the steps below: Step 1: Open up the data set “Spinner Options”. This contains the value on the 8 spinner locations, each option of which is equally likely to occur. Step 2: Click on Applets → Spinner (the second blue box at the top). Step 3: For the “Labels” option, select Spaces. For the “Weights” option, select Weights. Step 4: Select Compute! Step 5: A new window labeled “Spinner experiment” should open. Click the Spin button and Statcrunch will simulate using the spinner once. Click this button 2 more times so that you have 3 results. Step 6: Click the Analyze button. The data from your 3 spins should now be stored as a column in StatCrunch. Step 7: Resize the Spinner experiment window so that it contains 1 row of 3 spins. Copy this image into your document for your answer to part (b). c) In the Spinner experiment window, click the Reset button. Simulate another 3 spins and store the results in StatCrunch by clicking the Analyze button. Copy an image of the window into your document for your answer to part (c). Repeat this process three more times to produce a total of four sets of three spins. 2 d) To simulate 3 spins 100 times and find the total number of spaces moved for each, use the following steps: Step 1: Under Data → Simulate → select Custom Step 2: Under “Values in:”, select Spaces. Under “Weights in:”, select Weights. Step 3: Under “Number of rows and columns:”, enter 3 for Rows and 95 for Columns. Step 4: Select Compute! You should now have 100 (including 2 from use of the applet) columns with information for 3 spins. Step 5: From here go to Stat → Summary Stats → Columns Step 6: Select all columns except for the Spaces and Weights column (to do this click on your first “Spins” in the select column(s) box, hold the Shift key, scroll down and select “Custom95.” You should see 100 columns selected in the white box. Step 7: Under “Statistics:”, select only Sum. Step 8: Under “Output:”, check the box for Store in data table. Step 9: Click Compute! Make a properly titled and labeled relative frequency histogram out of the resulting Sum column. Copy the image of the histogram into your document for your answer to part (d). e) Use your results in part (d) to find the empirical probability of moving 10 or more spaces in 3 spins. Show the calculation for this empirical probability and state your probability as a decimal rounded to three decimal places. f) Calculate the theoretical probability of moving 10 or more spaces in 3 spins (i.e. obtaining the sum of the spins to be 10 or greater). Use your probability distribution in part (a) and note that spins are independent. (Hint: Recognize that getting a 1 on spin 1, a 2 on spin 2, and a 1 on spin 3 is a different result than getting a 1 on spin 1, a 1 on spin 2, and a 2 on spin 3.) Show how you obtained this probability and provide the answer. g) In a sentence, compare your empirical probability from part (e) to your theoretical probability in part (f). h) How would you expect empirical probability in part (e) to change if it had been based on a simulation of 1000 repetitions and why? Answer this question in one to two sentences. 3 Problem 2: Main Street Speed Limit A portion of Main Street (Route 236) in Fairfax, VA has a posted speed limit of 35 miles per hour. Fairfax police collected data on actual speed limits of a sample of 338 vehicles driving on this portion of Main Street between 2:30 and 3:30 p.m. The data set “Main Street Speed Data” contains this sample of vehicles speed limits (in MPH) collected over the past six months. a) Use StatCrunch to construct an appropriately titled and labeled relative frequency histogram of the vehicle speeds stored in the “Speed” variable. Copy your histogram into your document. b) What is the shape of this distribution? Answer this question in one complete sentence. c) Now overlay your histogram with a Normal curve and add a vertical line at the mean. This can be done by going to Options → Edit in the top left corner of your graph. Inside the histogram graph box, look for Display Options. Next to “Overlay distrib.:” click the arrow next to the word --optional-- and select Normal. Then, check the box next to mean under the word “Markers.” Copy and paste this histogram into your document. d) Do you think it is reasonable to use the normal model in this case? Answer this question in one complete sentence. e) Calculate the sample size, the mean, and the standard deviation of the “Speed” variable using StatCrunch. (Select Stat → Summary Stats → Columns.) Copy and paste this table into your document. Round the mean and standard deviation to two decimal places inside this table. For parts (f) – (h), assume that the distribution of all vehicle speeds in the population is Normal with the mean and standard deviation found in Part (e) (again use the rounded mean and standard deviation values). Note: you are using the Normal distribution for the next three calculations. f) Calculate the probability that a randomly selected vehicle is driving above the posted speed limit of 35 miles per hour. First, draw a picture with the mean labeled, shade the area representing the desired probability, standardize, and use the Standard Normal Table (Table 2 in your text) to obtain this probability. Please take a picture of your hand drawn sketch and upload it to your Word document (if you do not have this technology, you may use any other method (i.e. Microsoft paint) to sketch the image). You must type the rest of your “by hand” work to earn full credit. g) Verify your answer in part (f) using the StatCrunch Normal calculator (see instructions below) and copy that image into your document. In addition, write one sentence to explain what the probability means in context of the question. h) Use StatCrunch only to calculate the probability that a randomly selected vehicle was driving between 33 and 37 miles per hour. Copy the Normal distribution image from StatCrunch into your document. Then, once you obtain your answer, write one sentence to explain what the probability means in context of the question. 4 i) Suppose the police department decided that the top 20% of speeds would automatically receive a speeding ticket. Determine the minimum speed for which a driver would receive a speeding ticket. This speed or any speed above it will receive a ticket. Draw a picture (or two), shade area, and use Table 2 to solve this problem. Please take a picture of your hand drawn sketch and upload it to your Word document (if you do not have this technology, you may use any other method (i.e. Microsoft paint) to sketch the image). You must type the rest of your “by hand” work to earn full credit. j) Verify your answer in part (i) using the StatCrunch Normal calculator (see instructions below) and copy that image into your document. In addition, write one sentence to explain what the probability means in context of the question. Steps to produce StatCrunch Normal graphs. Step 1: Open the calculator by selecting Stat → Calculators → Normal as shown below. Standard – shows area above or below a specified x value. Between – shows area between two specified x values. Enter the value of the mean and standard deviation. Select to change the direction of the inequality sign to match question. Enter either a value in first box to find probability OR a probability in the last box to find a value. Step 2: Enter the values for the mean and standard deviation found in part 2d into their respective boxes. Problem 3: Celiac Disease Celiac disease is an autoimmune disorder where the ingestion of gluten leads to damage in the small intestine. Left untreated, celiac disease can lead to the development of other autoimmune disorders like Type I diabetes, multiple sclerosis, anemia, and osteoporosis. Generally, the later in life that celiac disease is diagnosed, the higher the chances of developing another autoimmune condition. In fact, it is known that 34% of individuals with celiac disease that is first diagnosed when they are 21 years of age or older will develop another autoimmune condition. Suppose we are interested in the number of individuals that develop another autoimmune disorder in a random sample of 9 people with celiac disease first diagnosed after they turn 21. Assume these people are independent of each other. a) Check if this situation fits the binomial setting. Write four complete sentences addressing each requirement in one sentence each. 5 b) Assuming this situation is a binomial experiment, build the probability distribution in table form in StatCrunch. There are two ways to do this. You may use Data → Compute → Expression and choose the function dbinom. This method relies on you entering the values of the random variable in the first column of your data table. The other way to do this is to use the binomial calculator and calculate the probability of each of the values of the random variable from X = 0 to X = 9. You may present this table horizontally or vertically and leave the probabilities unrounded. c) Calculate the probability that exactly three people in the sample develop another autoimmune disorder using the StatCrunch binomial calculator. Copy this image from StatCrunch into your document. Then, once you obtain your answer, write one sentence to explain what the probability means in context of the question. d) Calculate the probability that no more than 6 people in the sample develop another autoimmune disorder using the StatCrunch binomial calculator. Again, provide a StatCrunch binomial calculator graph to display your answer. Then, once you obtain your answer, write one sentence to explain what the probability means in context of the question. e) Calculate the probability that between 2 and 5 people in the sample (inclusive) develop another autoimmune disorder. Show your work using the probability distribution you built in part (b) to answer this question. Then, verify it with a StatCrunch binomial calculator graph and include this image in your document as well. Finally, once you obtain your answer, write one sentence to explain what the probability means in context of the question. f) Calculate the mean and standard deviation of this probability distribution. Show your work using the binomial mean and standard deviation formulas and provide your answers in your document. (No need to use StatCrunch for this part). Problem 4: Building a Sampling Distribution We will use the Sampling Distribution applet in StatCrunch to investigate properties of the sampling distribution of the proportion of students that find themselves distracted by their cell phone during class. Historically, it is known that 72% of students get distracted by their cell phone. Under Applets, open the Sampling distribution applet (box shown below). First, select Binary for the population, then enter the value for p = 0.72, the proportion of students who are distracted next to “p:” Then click on Compute. See image below. 6 a) Once the applet box is opened, enter 10 in the box to the right of the words “sample size” in the right middle of the applet box window (see image below). Then, at the top of the applet, click “1 time.” Watch the resulting animation. When the sample is completed, copy and paste the entire applet box (using options → copy) into your document. b) Click Reset at the top of the applet. Then, click the “1000 times” to take 1000 samples of size 10. Copy and paste the applet image into your document. c) Describe the shape of the Sample Proportions graph at the bottom of your image from part (b) in one sentence. 7 d) Why do you think that this graph does not have an approximately Normal shape? Use the Central Limit Theorem large sample size condition to answer this question in one sentence. Explicitly show these calculations. e) Click Reset at the top of the applet. Type 100 in the sample size box. Then, click the “1000 times” to take 1000 samples of size 100. Copy and paste the applet image into your document. f) Describe the shape of the Sample Proportions graph at the bottom of your image from part (e) in one sentence. g) Why do you think that this graph from part (f) has the shape you described? Use the Central Limit Theorem large sample size condition to answer this question in one sentence. Explicitly show these calculations. h) Using the image in part (e), write the values you obtained for the mean (in green) and the standard deviation (in blue). These values are found in the bottom right box labeled “Sample Prop. of 1s.” i) Compare the mean value (in green, found in part (h)) to the known population proportion in one sentence. j) Now calculate the standard error of the sample proportion using p = 0.72 and n = 100 by hand. Show this calculation “by-hand” and round your answer to three decimal places. Type your “by-hand” work. k) Compare the value in part (j) to the standard deviation (in blue) you obtained in part (h) in one sentence. l) Finally, use the sampling distribution defined by the Central Limit Theorem to calculate the probability that from a sample of 100 students at least 80% are distracted by their cell phones (using p = 0.72 and the standard error found in part (j)). Show your work by using the formula to calculate the z-value and using the standard Normal probability table to obtain your answer. Type your “by-hand” work. m) Interpret the resulting probability from part (l) in context. 8 1 Sample Solution to Display Formatting Problem X: Students’ Grades A random sample of 30 students was selected from a STAT 250 course taught during the summer session and their first exam scores were recorded. a) Create a histogram in StatCrunch. Be sure to title and label it correctly. b) Interpret the histogram’s shape See sample solution and formatting on page 2. Notes about submission Following the main points will help you submit a professionally completed assignment. 1) 2) 3) 4) Right justify your name and provide your correct section and the due date. Center the specific homework assignment title. Bold each problem complete problem number. The graph can be around the below size for readability (click on the graph once and only adjust the size of the graph by using the bottom right dot) 5) Remember not to include the questions in your answer. Only provide answers. Please keep the assignment in problem and part order (present 1a, then 1b, and so on). 2 Kenneth Strazzeri STAT 250-0xx (your correct section) Data Analysis Assignment 1 Problem X a) b) The shape of this distribution is left skewed because I see the majority of the data values falling in the upper end of the distribution and a few 50s and 60s skewing the shape. There does not seem to be any outliers visible on the graph.
Purchase answer to see full attachment
User generated content is uploaded by users for the purposes of learning and should be used following Studypool's honor code & terms of service.

Explanation & Answer

Review and lemme kmow if you need anything else

1
Name
STAT 250-0xx (your correct section)

Data Analysis Assignment #2
Problem 1: Game Spinner
a) Let x be a random variable representing the landing space of a single spin. Therefore x
will take the values for the spinner locations 1, 2, 3 or 4. There are eight spinner locations
for the spinner 3 times, that is, for spaces of 1, two spaces of 2 and a single space for both
3 and 4.

The probability distribution table for the result of a single spin
X

Probability[P(x)]

Total

1

0.5

2

0.25

3

0.125

4

0.125
1

b) Simulating using the spinner 3 times

2
c) Simulating another 3 spins

3
d)

e) The total number of trials = 100 and the relative frequency from sum 10 to 12 (E) is
{[10, 11) + [11, 12]} = 0.04. Frequency from sum 10 to 12 (E) is 100*0.04 = 4
Therefore the empirical probability of moving 10 or more spaces in 3 spins is
𝑃 (𝐸) =

𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝐸
𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑖𝑎𝑙𝑠 𝑖𝑛 𝑡ℎ𝑒 𝑒𝑥𝑝𝑒𝑟𝑖𝑚𝑒𝑛𝑡
=

4
100

= 0.04
f) The theoretical probability of moving 10 or more spaces in 3 spins is the probaility of
getting a combination of ( 2,4,4) or (...


Anonymous
I was struggling with this subject, and this helped me a ton!

Studypool
4.7
Trustpilot
4.5
Sitejabber
4.4

Related Tags