~Class meeting #4




~Z-Scores: (Standard Scores)

~Used primarily for comparing scores from different data sets relative to their means.
~gives the number of standard deviations a given score is from its mean.

~Ex: if the mean of a data set is 80 & the S=5, then a Z-score of +1 would be 85

~Ex: Student A got 85 on a test whose mean=79 & s = 8. Student B got 74 on a test whose mean=70 s = 5. Who did better?  Answer: compute both Z-scores & the HIGHER would be the better score relative to its test.

~Formula:   Z = (x - m ) / S, where x is a score, m is the sample mean, S is the sample standard deviation

        Z = (x-m)/s, where x is a score, m is the population mean, s is the population standard deviation

Z = (85-79)/8 = 0.75,  Z = (74-70)/5 = 0.80, so, student B did better.


~Unusual values: more than 2 standard deviations away from the mean (Z>2 or Z<-2)

~Ex: All women in pro basketball have a mean height of 63.6" (m) & a standard deviation of 2.5" (s), Rebecca Lobo is 76" tall. Is she considered unusually tall for her sport?

Compute her Z-score:   Z = (76-63.6)/2.5 = 4.96 (very much so!)

~Quartiles: dividing a data set into 4 quarters.               _____._____._____._____

~the data points, L, Q1, Q2, Q3, H  are known as the 5 number summary & are important to known for the construction of Box-plots.  Q2 is also known as the median or P50 (later)

~How to find them:
1) Enter the data in a list. (the TI-83 will give you these)
2) Find Q2 (median) (50% of the data is below Q2 & 50% is above Q2)
3) For Q1, find the median for all scores L to Q2 (separates the bottom 25% from the top 75%)
4) For Q3, find the median for all scores Q2 to H (separates the bottom 75% from the top 25%)

~Note: when finding Q1 & Q3, use the scores in the data set.

~Ex: take the data set 2,5,3,4,7,0,11,2,3,8.  
       Sort: 0,2,2,3,3,4,5,7,8,11, n=10,
       x = 3.5 = Q2 = median
       Q1 = median of all scores from 0 to 3.5 = 2 = Q1
       Q3 = median of all scores from 3.5 to 11 = 7

~Percentiles: (used extensively in educational testing. Usually for large amounts of data)

~dividing or separating the data into 100 sections.
  The symbol for the k th percentile is Pk.

~Note:  Q1 = P25,  Q2 = P50, Q3 = P75.  Finding these scores is a bit tricky, so be careful.

~How to find them:
1) Sort the data from low to high. The TI-83 will do this for you after you enter them in a list.
2) To find the k th percentile, find its Location.
Take k% of n (# of scores) & look at the result. If the result is a whole  #, average the score in this position with the next one. If the result is a decimal, round up & take the score in that position.

~let’s check Q1
& Q3 from our last example. We have n=10, so find P25 & P75.

~For P25, take 25% of 10 = 2.5. Since the result is a decimal, round up to 3 & take the 3rd score (2).

~For P75, take 75% of 10 = 7.5. Since the result is a decimal, round up to 8 & take the 8th score (7).

~Ex: For the data set, 34,39,63,64,67,70,75,76,81,82,84,85,86,88,89,89,90,96,96,100
       Find P25, P33, P50, P75, P80

~For P25, take 25% of 20 = 5, since the result is a whole #, average the scores in the 5Th & 6Th place. We get, (67+70)/2 = 68.5 = Q1

~For P33, take 33% of 20 = 6.6, since the result is a decimal, round up to 7 & take the 7th score (75)

~For P50, take 50% of 20 = 10, since the result is a whole #, average the scores in the 10th & 11th place. We get, (82+84)/2 = 83 = Q2

~For P75, take 75% of 20 = 15, since the result is a whole #, average the scores in the 15th & 16th place. We get, (89+89)/2 = 89 = Q3

~For P80, take 80% of 20 = 16, since the result is a whole #, average the scores in the 16th & 17th place. We get, (89+90)/2 = 89.5

~Finding a percentile given a score: (This is going in reverse direction)

~Just find the number of scores BELOW the given score & convert it to a percent.

~Ex: In the above data set, the score of 86 is in what percentile?

~find the number of scores below 86, which is 12. So, 12/20 = .60 = 60%, so, 86 is in the 60th percentile (it’s also in the 61th percentile).
So, this might account for a few discrepancies you might encounter.

~Note: When your data set is relatively small (less than 100), a given score could be in more than one percentile. The ideal situation would be 100 scores, then each score would represent a different percentile. For very large data sets (usually the case), many scores are in a given
percentile.