Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

The Fantasy Baseball Project - Applied Statistics | MATH 170, Study Guides, Projects, Research of Statistics

Material Type: Project; Professor: Best; Class: Applied Statistics; Subject: Mathematics; University: Western Carolina University; Term: Fall 2006;

Typology: Study Guides, Projects, Research

Pre 2010

Uploaded on 08/19/2009

koofers-user-t7u
koofers-user-t7u 🇺🇸

2

(1)

10 documents

1 / 4

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
MATH 170 - Applied Statistics
The Fantasy Baseball Project
DUE by Friday, December 8th, 2006
Project Description:
From a pool of 25 Major League Baseball (MLB) players, you will choose a ten-player team. Your choice of team
will be based on the statistical analysis of several categories. You can find the pool of 25 players in Appendix
A. These 25 players are a simple random sample of all players in MLB with at least 350 at-bats during the 2006
season.
This project consists of ten different parts with two different frames of mind.
Single Minded Criteria: Parts 1-9 require a statistical analysis of each of the nine “BEST TEAM
CHARACTERISTICS” given in the list below. For each characteristic (part), you will choose what you
consider to be the “best” ten-player team from our list of twenty-five eligible players, in terms of that single
criteria. As each of the first nine parts focuses on a specific characteristic, your choice of optimal ten-player
teams may likely change from part to part.
Universal/Multi- Criteria: Part 10 of the project has a more universal focus. This where you are asked
to choose your fantasy team. By this we mean that you want to determine your ten-player team (again,
out of the twenty-five eligible players) that will rank the best in the class with respect to all of the “best
team characteristics”. Once you have determined your team members, you are to compute the values of
each of the nine characteristic for this team. In the grading process, your fantasy team will be compared
to the fantasy teams from the rest of the class. The fantasy teams will be ranked in the following manner:
Suppose there are ndifferent values in the class for a given “best team characteristic”. If your fantasy
team is the best for this characteristic, you will get npoints; if your team is second best, you will get n1
points; if your team is the worst, you will get 1 point. This will be done for each of the nine best team
characteristics. You win if you have the highest cumulative number of points in the class.
BEST TEAM CHARACTERISTICS
1. Most runs batted in per player
2. Fewest strike outs per player
3. Most home runs per player
4. Highest batting average per player
5. Least variation in stolen bases per player
6. Strongest linear relationship between number of runs and number of home runs
7. Smallest probability that a randomly chosen player has less than ten stolen bases
8. Most narrow 95%-confidence interval for the mean batting average in MLB
9. Smallest p-value for the hypothesis test: H0:πhighAVG =.5 vs. HA:πhighAVG > .5
pf3
pf4

Partial preview of the text

Download The Fantasy Baseball Project - Applied Statistics | MATH 170 and more Study Guides, Projects, Research Statistics in PDF only on Docsity!

MATH 170 - Applied Statistics The Fantasy Baseball Project DUE by Friday, December 8th, 2006

Project Description: From a pool of 25 Major League Baseball (MLB) players, you will choose a ten-player team. Your choice of team will be based on the statistical analysis of several categories. You can find the pool of 25 players in Appendix A. These 25 players are a simple random sample of all players in MLB with at least 350 at-bats during the 2006 season.

This project consists of ten different parts with two different frames of mind.

  • Single Minded Criteria: Parts 1-9 require a statistical analysis of each of the nine “BEST TEAM CHARACTERISTICS” given in the list below. For each characteristic (part), you will choose what you consider to be the “best” ten-player team from our list of twenty-five eligible players, in terms of that single criteria. As each of the first nine parts focuses on a specific characteristic, your choice of optimal ten-player teams may likely change from part to part.
  • Universal/Multi- Criteria: Part 10 of the project has a more universal focus. This where you are asked to choose your fantasy team. By this we mean that you want to determine your ten-player team (again, out of the twenty-five eligible players) that will rank the best in the class with respect to all of the “best team characteristics”. Once you have determined your team members, you are to compute the values of each of the nine characteristic for this team. In the grading process, your fantasy team will be compared to the fantasy teams from the rest of the class. The fantasy teams will be ranked in the following manner:

Suppose there are n different values in the class for a given “best team characteristic”. If your fantasy team is the best for this characteristic, you will get n points; if your team is second best, you will get n − 1 points; if your team is the worst, you will get 1 point. This will be done for each of the nine best team characteristics. You win if you have the highest cumulative number of points in the class.

BEST TEAM CHARACTERISTICS

  1. Most runs batted in per player
  2. Fewest strike outs per player
  3. Most home runs per player
  4. Highest batting average per player
  5. Least variation in stolen bases per player
  6. Strongest linear relationship between number of runs and number of home runs
  7. Smallest probability that a randomly chosen player has less than ten stolen bases
  8. Most narrow 95%-confidence interval for the mean batting average in MLB
  9. Smallest p-value for the hypothesis test: H 0 : πhighAVG = .5 vs. HA : πhighAVG >. 5

Single Minded Criteria:

Part 1: Most Runs Batted In Per Player

Description: A player’s number of runs batted in appears in the column labeled RBI. Two common interpretations of the word “most” are: the highest mean of your ten values and the highest median of your ten values. For this part we will use the highest mean interpretation. Analysis: Make a graphical display of the RBI data (all 25 points). You can choose from: histogram (with equal or unequal class-interval lengths), stem-and-leaf plot, boxplot, and dotplot. If you choose to make a histogram, include a frequency table. What ten-man team should you choose if “most” means “highest mean?” Is there a single answer to this question? Explain.

Part 2: Fewest strike outs per player

Description: The number of times a player struck out is in the column labeled SO. Two common inter- pretations of the word “fewest” are: the smallest mean of your ten values and the smallest median of your ten values. For this part we will use the smallest mean interpretation. Analysis: Make a graphical display of the SO data (all 25 points). Your choice here must be different than your choice in Part 1. What ten-man team should you choose if “fewest” means “smallest mean?”

Part 3: Most home runs per player

Description: A player’s number of home runs is in the column labeled HR. For this part we will use the “highest median” interpretation of the word “most.” Analysis: Make a graphical display of the HR data (all 25 points). Your choice here must be different than your choices in Parts 1 and 2. What ten-man team should you choose if “highest” means “highest median?” Is there a single answer to this question? Explain.

Part 4: Highest batting average per player

Description: A player’s batting average is computed by dividing his number of hits by his number of at-bats (H/AB). For this part we will use the “highest median” interpretation of the word “highest.” Analysis: This data does not appear in the data set directly, so insert an “AVG” column. Make a graphical display of the batting average data (all 25 points). Your choice here must be different than your choices in Parts 1, 2, and 3. What ten-man team should you choose if “highest” means “highest median?”

Part 5: Least variation in stolen bases per player

Description: The number of bases that a player stole is in the column labeled SB. Three common statistical interpretations of the word “variation” are: the range, the interquartile range, and the standard deviation of your ten values. Analysis: Make a dotplot of the SB data (all 25 points). What ten-man team should you choose if “variation” means “range?” What if “variation” means “interquartile range?” What if “variation” means “standard deviation?” Use your dotplot to help explain why you chose each team.

Part 6: Strongest linear relationship between number of runs and number of home runs

Description: You want a team for which the number of runs (R) and the number of home runs (HR) have the strongest linear relationship. Analysis: Make a scatterplot of R vs. HR. (Place R on the horizontal axis.) What ten-player team appears to have a very strong linear relationship between these variables? Identify your team on the scatterplot. For this team, what is the correlation coefficient?

A Player Data

The following data represents a random sample of the 221 Major League Baseball (MLB) players with over 350 at-bats during the 2006 regular season. [2]

Player Number Name AB R H HR RBI SO SB 1 J Pierre 699 87 204 3 40 38 58 2 M Teixeira 628 99 177 33 110 128 2 3 R Sexson 591 75 156 34 107 154 1 4 L Castillo 584 84 173 3 49 58 25 5 M Lowell 573 79 163 20 80 61 2 6 A Gonzalez 570 83 173 24 82 113 0 7 B Roberts 563 85 161 10 55 66 36 8 T Iguchi 555 97 156 18 67 110 11 9 C Guillen 543 100 174 19 85 87 20 10 B Hall 537 101 145 35 85 162 8 11 S Green 530 73 147 15 66 82 4 12 W Taveras 529 83 147 1 30 88 33 13 C Delgado 524 89 139 38 114 120 0 14 J Mauer 521 86 181 13 84 54 8 15 M DeRosa 520 78 154 13 74 102 4 16 R Hernandez 501 66 138 23 91 79 1 17 D DeJesus 491 83 145 8 56 70 6 18 E Chavez 485 74 117 22 72 100 3 19 G Jenkins 484 62 131 17 70 129 4 20 J Uribe 463 53 109 21 71 82 1 21 M Cabrera 460 75 129 7 50 59 12 22 I Kinsler 423 65 121 14 55 64 11 23 Y Molina 417 29 90 6 49 41 1 24 J Gomes 385 53 83 20 59 116 1 25 M Izturis 352 64 103 5 44 35 14

References

[1] Calculations on a Curveball: The Many Figures of Baseball. Turner Le@rning: The Educational Division of CNN and Turner Broadcasting http://www.turnerlearning.com/efts/bball/numbers.htm.

[2] ESPN Major League Baseball Data Web Site. ESPN (MLB) Player Batting Stats: 2006 at sports.espn.go.com/mlb/stats/batting?league=mlb.

[3] Fantasy Baseball. Wikipedia, The Free Encyclopedia: http://en.wikipedia.org/wiki/Fantasy baseball.

[4] The College Board Advanced Placement Program: Statistics Course De- scription, May 2005 - May 2006. The College Board Web Site, http://apcentral.collegeboard.com/repository/statistics cd 0506 4328.pdf, 2004.

[5] Roxy Peck, Chris Olsen, and Jay Devore. Introduction to Statistics and Data Analysis, 2 nd^ Edition. Thomson Brooks/Cole, 2005.