Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Econ 105 Stats Review: Means, Variance, Interquartile Range, and Prob. Distributions (90 c, Exams of Statistics

A statistics review for economics 105 students at davidson college. It covers calculating means for two datasets, variance, interquartile range, and coefficient of variation. Additionally, it includes problem-solving on probability distributions, including expected value and variance for bernoulli random variables and chebyshev's inequality.

Typology: Exams

Pre 2010

Uploaded on 08/09/2009

koofers-user-l0m
koofers-user-l0m 🇺🇸

10 documents

1 / 7

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Name:
Statistics Davidson College
Economics 105, Jan-May 2004 Mark C. Foley
Review # 1
Suggested Solutions
Due in class Friday.
Directions: This review is closed-book, closed-notes (except for your formula sheet) to be
taken in one sitting not to exceed 3 hours. You may use a calculator. You may not use Excel.
Perform your calculations to 3 decimal places, where necessary.
There are 100 points on the exam. Each problem is worth 25 points.
You must show all your work to receive full credit. Any assumptions you make and
intermediate steps should be clearly indicated. Do not simply write down a final answer to the
problems without an explanation.
Please turn in your formula sheet with your exam.
Carpe diem.
Honor Pledge
Start time
End time
pf3
pf4
pf5

Partial preview of the text

Download Econ 105 Stats Review: Means, Variance, Interquartile Range, and Prob. Distributions (90 c and more Exams Statistics in PDF only on Docsity!

Name:

Statistics Davidson College

Economics 105, Jan-May 2004 Mark C. Foley

Review # 1

Suggested Solutions

Due in class Friday.

Directions : This review is closed-book, closed-notes (except for your formula sheet) to be

taken in one sitting not to exceed 3 hours. You may use a calculator. You may not use Excel.

Perform your calculations to 3 decimal places, where necessary.

There are 100 points on the exam. Each problem is worth 25 points.

You must show all your work to receive full credit. Any assumptions you make and

intermediate steps should be clearly indicated. Do not simply write down a final answer to the

problems without an explanation.

Please turn in your formula sheet with your exam.

Carpe diem.

Honor Pledge

Start time

End time

Problem 1

Consider the following two datasets on the number of mangoes eaten per day:

Sample K: 1 2 3 5 5 5 7 8 9

Population C: 1 2 3 5 5 5 7 8 9

(a) Calculate the mean of each dataset. If different, explain why.

For sample K,

1

n

x

x

n

i

i mangoes

For population C,

1

N

c

N

i

i

C

mangoes

(b) Calculate the variance of each dataset. If different, explain why.

For sample K,

2

1

2 2

1

2

2

 

 

n

x n x

n

x x

s

n

i

i

n

i

i

mangoes

2

[( 4 ) ( 3 ) ( 2 ) 0 0 0 2 3 4 )

2 2 2 2 2 2

1

2

2

N

x

N

i

i

mangoes

2

They are different because dividing by (n-1) instead of n, as we might expect, makes

2 2 E [ s ]  , which is a desirable property (called “unbiasedness”). In other words, if we

took a really large number of samples, calculated s

2 for each of them and plotted a

histogram of all the s

2 , the mean value (expected value) would be

2

. If the definition of

sample variance actually divided by n, call this “bad definition”

s , then^

2 2 ]

~ E [ s  .

(c) Calculate the interquartile range and coefficient of variation for Sample K. Indicate your

units. Briefly explain why the coefficient of variation is useful in comparing the variability of two

or more variables.

Q 3 =¾(n+1)

th observation = 30/4 = 7.

th observation which is 7 + .5(8-7) = 7.

Q 1 =¼(n+1)

th observation = 10/4 = 2.

th observation which is 2 + .5(3-2) = 2.

Interquartile range is 7.5 – 2.5 = 5 mangoes. This is the range for the middle 50% of the

data.

  • 100 53. 852 % 5
  1. 25    mangoes

mangoes

x

s CV

The coefficient of variation is useful in comparing variability of two variables because it

is not affected by the units in which those variables are measured. CV standardizes the

sample variance by the sample mean, so that the units cancel.

Problem 2

(a) Consider K Bernouilli random variables, X 1 , X 2 , …, XK. Let the probability of success for

each random variable be p. Let the random variable (^) 

K

i

i

B X

1

. Derive the expected value

and variance of B.

We know the E^ [^ Xi ] p and Var^ [^ Xi ]^ p (^1  p ).

E B EX X X EX EX EX p p p Kp K K

[ ] [  ... ] [ ] [ ]... [ ]  ... 

1 2 1 2

[ ] [ ... ] [ ] [ ] ... [ ] ( 1 ) ( 1 ) ... ( 1 )

1 2 1 2

Kp p

Var B Var X X X Var X Var X Var X p p p p p p K K

The variance of the sum of independent r.v.’s is the sum of the variances.

(b) Let Y = a + bX, where a and b are constants, and X and Y are random variables. Prove that

Var[Y] = b

2 Var[X]

[ ( ) ] [( ) ] [ ]

[ ] [( ) ] [(( ) ( )) ] [( ( )) ]

2 2 2 2 2

2 2 2

Eb X bE X bVar X

VarY E Y E a bX a b E a a b X

X X

Y X X

(c) Prove that the random variable

X

X

X

Z

 has mean 0 and variance 1.

[ ] 

^ 

 X

X X

X

X X

X

X

X

E X

X

EZ E 

[ ]

[ ]

2

2 2

^ 

X

X X X X

X

X

X Var X VarX

X

VarZ Var 

Problem 3

(a) Ferrucci’s deli at Exit 28 is running a promotion in which 10 percent of all vouchers they give

away upon walking in the store are worth a free sandwich (not really; it’s just my favorite deli).

Assume that a win on one voucher is independent of a win on another voucher. How many

vouchers would an individual need to collect in order to have a 50% chance of winning at least

one free sandwich?

Let A’ = no sandwiches won with X vouchers.

Then A = at least one sandwich won with X vouchers

The probability of a win with one voucher is .10, so the probability of not winning with

one voucher is .90. If you have X vouchers, the event that none of them wins is the

intersection of the events “no win” for each of the X vouchers. Given independence,

P(A’) = (9/10)

X .

So P(A) = 1 – P(A’) = 1 -.

X

. So we want .50 = 1 -.

X.

X = .5x ln .9 = ln .5x =

6.57. So the person needs 7 vouchers.

(b) In a survey of investment bankers, it was found that 46% had either traveled internationally

on business or could speak a foreign language. The probability that a banker who had not

traveled internationally could speak a foreign language was 10%. It was found that only 4% of

the bankers had traveled internationally and could speak a foreign language. What is the

probability that a banker who speaks a foreign language has not traveled internationally?

Let I = traveled internationally

F= speaks foreign language

Given P (^^ F ^ I ).^46 , P (^^ F | I ).^10 , P (^ F ^ I ).^04

The question asks for P (^ IF )?

PF

P F

P F

PF PI F

P F

P I F

P I F

So finding P(F) will finish the problem.

 P ( F  I ) P ( F ) P ( I ) P ( F  I ). 46  P ( F ) P ( I ). 04

Now we need P(I), in order to get P(F). And in order to get P(I), we need P(I’).

   PI

PI P I

PF I

PF I

P I

PI F

PF I

Thus, P (^ I )^1  P ( I )^1 .^6 .^40

Substituting into equation, P (^ F ).^46 .^04 .^40 .^10

Substituting into equation,.^60

. 10

PI  F 

Problem 4

Assume that the distance an inconsistent decathlete throws the javelin is distributed as a

uniform random variable with positive probability density between -10 and 75 meters.

(a) Write the complete formulas for and draw properly-labeled graphs of the probability density

function and cumulative probability density function for the distance of javelin throws.

     otherwise

x f x 0 ,

1 / 85 , 10 75 ( )

 

 

   

   (^) ^  

1 , 75

, 10 75

0 , 10 ( ) 8510

x

x

x F x bx a^ a x

(b) The women’s world record in the javelin throw was set on July 1, 2001 by Osleidys

Menendez of Cuba at 71.5 meters. What is the probability this inconsistent decathlete breaks

the record?

P ( 71. 5  x  75 ) F ( 75 ) F ( 71. 5 )[ 1 ( 71. 5  10 )/ 85 ][ 1 . 9588 ]. 0412

(c) Find the expected value of the distance squared.

 75 ( 1000 ) 1658. 333

85 * 3

1

85 3

1 [ ] ( ) ( )

3

75

10

(^753)

10

85

(^2 )     

   

 

 

 

  

x E X x f xdx x f xdx x dx

b

a

(d) Find the variance of the distance.

  1. 083

2

10 75

  1. 333

2

[ ] [ ] 1658. 333

2 2

2 2  

 

    ^ 

 

     

a b Var X E X X

F(x)

-10 75 x

1

0

f(x)

x

-10 75

1/