{"id":412,"date":"2020-04-10T17:02:35","date_gmt":"2020-04-10T21:02:35","guid":{"rendered":"http:\/\/pressbooks.library.upei.ca\/montelpare\/?post_type=chapter&#038;p=412"},"modified":"2021-05-19T06:16:48","modified_gmt":"2021-05-19T10:16:48","slug":"measures-of-central-tendency-variance-and-confidence-intervals","status":"publish","type":"chapter","link":"https:\/\/pressbooks.library.upei.ca\/montelpare\/chapter\/measures-of-central-tendency-variance-and-confidence-intervals\/","title":{"raw":"Measures of Central Tendency","rendered":"Measures of Central Tendency"},"content":{"raw":"<h1>PART 1: Measures of Central Tendency<\/h1>\r\nThe most common measure of central tendency is the <strong>mean <\/strong>or <strong>average <\/strong>score. The mean is a calculated score that is intended to represent all of the scores in the distribution (set of scores).\r\n\r\nThe formula for the mean of a sample is shown here:\r\n\r\n[latex]{\\overline{x}} = <span>\\Sigma{(x_i)<\/span>\\over{n}}[\/latex]<code><span class=\"pun\"><\/span><\/code>\r\n\r\nWhere:\r\n<ul>\r\n \t<li>[latex]{\\overline{x}}[\/latex] refers to the sample mean<\/li>\r\n \t<li>[latex]<span>\\Sigma{(x_i)<\/span>} refers to the sum of all the scores<\/li>\r\n \t<li>i refers to the \u201cith\u201d case within the distribution<\/li>\r\n \t<li>n refers to all of the cases within the distribution.<\/li>\r\n<\/ul>\r\nTo calculate the mean for a continuous variable, add up all of the values and divide the sum of values by the number of values. Below is a set of blood glucose measures for 5 patients. These data are represented in millimoles per litre (mmol\/L). P<sub>n<\/sub> represents the nominal value label for each patient, so that P1 is patient 1.\r\n\r\nP<sub>1<\/sub> 4.2 mmol\/L, P<sub>2<\/sub> 5.6 mmol\/L, P<sub>3<\/sub> 7.9 mmol\/L, P<sub>4<\/sub> 10.2 mmol\/L, P<sub>5<\/sub> 7.5 mmol\/L,\r\n\r\nFollow these steps to calculate the mean:\r\n<ul>\r\n \t<li>First add the values together: 4.2 + 5.6 + 7.9 + 10.2 + 7.5 = 35.4.<\/li>\r\n \t<li>Next, divide by the number of values (to produce the average): 35.4\/5 = 7.08 mmol\/L<\/li>\r\n<\/ul>\r\nWe can also use SAS to compute the mean for a set of scores. Two specific SAS programs that process measures of central tendency are PROC MEANS, and PROC UNIVARIATE. Each of these programs was designed to produce descriptive statistics for a sample of scores. Below are the SAS commands to compute the mean for a set of 10 resting heart rate scores. In this first program we used the SAS procedural command PROC MEANS to compute three basic estimates: the mean, the standard deviation and the minimum\/maximum scores for the sample dataset of 10 numbers.\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">SAS PROC MEANS to Produce Descriptive Statistics for a Sample of 10 Numbers<\/header>\r\n<div class=\"textbox__content\">DATA MN_HR;\r\nINPUT ID SCORE @@;\r\nDATALINES;\r\n01 48 02 54 03 66 04 72 05 56 06 68 07 48 08 67 09 55 10 84\r\n;\r\nPROC MEANS DATA=MN_HR;\r\nVAR SCORE;\r\nRUN;<\/div>\r\n<\/div>\r\nNotice in the code written above, the semi-colon (;) is placed on a separate line below the set of scores.\r\n\r\nWhile PROC MEANS, in its simplest form (without options) provides three basic estimates that describe estimates within a distribution, the SAS procedural command PROC UNIVARIATE not only computes the mean but also creates the Basic Statistical Measures Table which provides an entire summary of descriptive statistics. The output generated by the SAS program above \u2013 using the PROC MEANS statement without options \u2013 produced a table of summary estimates that included the mean and standard deviation as well as the minimum and maximum values for the dataset.\r\n\r\n<strong>SAS Output from the MEANS Procedure: Variable of interest was Heart Rate<\/strong>\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr>\r\n<td style=\"width: 187.05px\"><strong>N<\/strong><\/td>\r\n<td style=\"width: 51.85px\"><strong>Mean<\/strong><\/td>\r\n<td style=\"width: 67.05px\"><strong>Std Dev<\/strong><\/td>\r\n<td style=\"width: 82.25px\"><strong>Minimum<\/strong><\/td>\r\n<td style=\"width: 83.85px\"><strong>Maximum<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 187.05px\">10<\/td>\r\n<td style=\"width: 51.85px\">61.80<\/td>\r\n<td style=\"width: 67.05px\">11.56<\/td>\r\n<td style=\"width: 82.25px\">48.00<\/td>\r\n<td style=\"width: 83.85px\">84.00<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\nWhen we call the PROC UNIVARIATE procedure of SAS, the output is a more complete table of summaries that include estimates of centrality but also the moments, measures of variance, and the tests of the location of the mean, as shown below.\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">SAS PROC UNIVARIATE to Produce Descriptive Statistics for a Sample of 10 Numbers<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nPROC UNIVARIATE DATA=MN_HR;\r\nVAR SCORE;\r\nRUN;\r\n\r\n<\/div>\r\n<\/div>\r\n<strong>The UNIVARIATE Procedure -- <\/strong><strong>Variable: SCORE<\/strong>\r\n<div align=\"center\">\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>\u00a0MOMENTS\u00a0<\/strong><\/td>\r\n<td style=\"width: 76.65px\"><\/td>\r\n<td style=\"width: 126.25px\"><strong>\u00a0<\/strong><\/td>\r\n<td style=\"width: 73.45px\"><\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>N<\/strong><\/td>\r\n<td style=\"width: 76.65px\">10<\/td>\r\n<td style=\"width: 126.25px\"><strong>Sum Weights<\/strong><\/td>\r\n<td style=\"width: 73.45px\">10<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Mean<\/strong><\/td>\r\n<td style=\"width: 76.65px\">61.8<\/td>\r\n<td style=\"width: 126.25px\"><strong>Sum Observations<\/strong><\/td>\r\n<td style=\"width: 73.45px\">618<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Std Deviation<\/strong><\/td>\r\n<td style=\"width: 76.65px\">11.5547008<\/td>\r\n<td style=\"width: 126.25px\"><strong>Variance<\/strong><\/td>\r\n<td style=\"width: 73.45px\">133.511111<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Skewness<\/strong><\/td>\r\n<td style=\"width: 76.65px\">0.55954538<\/td>\r\n<td style=\"width: 126.25px\"><strong>Kurtosis<\/strong><\/td>\r\n<td style=\"width: 73.45px\">-0.2284272<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Uncorrected SS<\/strong><\/td>\r\n<td style=\"width: 76.65px\">39394<\/td>\r\n<td style=\"width: 126.25px\"><strong>Corrected SS<\/strong><\/td>\r\n<td style=\"width: 73.45px\">1201.6<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Coeff Variation<\/strong><\/td>\r\n<td style=\"width: 76.65px\">18.6969269<\/td>\r\n<td style=\"width: 126.25px\"><strong>Std Error Mean<\/strong><\/td>\r\n<td style=\"width: 73.45px\">3.65391723<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr>\r\n<td><strong>Tests\u00a0for\u00a0Location:\u00a0Mu0=0<\/strong><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Test<\/strong><\/td>\r\n<td><strong>STATISTIC<\/strong><\/td>\r\n<td><strong>ESTIMATE\r\n<\/strong><\/td>\r\n<td><\/td>\r\n<td><strong>p Value<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td><strong>Student's t<\/strong><\/td>\r\n<td><strong>t<\/strong><\/td>\r\n<td>16.91336<\/td>\r\n<td><strong>Pr &gt; |t|<\/strong><\/td>\r\n<td>&lt;.0001<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Sign<\/strong><\/td>\r\n<td><strong>M<\/strong><\/td>\r\n<td>5<\/td>\r\n<td><strong>Pr &gt;= |M|<\/strong><\/td>\r\n<td>0.0020<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Signed Rank<\/strong><\/td>\r\n<td><strong>S<\/strong><\/td>\r\n<td>27.5<\/td>\r\n<td><strong>Pr &gt;= |S|<\/strong><\/td>\r\n<td>0.0020<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n\r\n<hr \/>\r\n\r\n<h2>Comparing the Mean for a Sample to the Expected Mean for a Population<\/h2>\r\nIn the output from the PROC UNIVARIATE procedure, SAS includes a table in which the mean for the variable: SCORE is compared to the mean for the Standard Normal Distribution (SND). The SND represents the hypothetical population mean and has a value of 0 with a standard deviation of 1.\u00a0 In the SAS table shown above, entitled <strong>Tests\u00a0for\u00a0Location:\u00a0Mu0=0<\/strong> the comparison of the sample mean ([latex]{\\overline{x}}[\/latex] ) to the population ([latex]{\\mu}[\/latex] ) is evaluated with the Student\u2019s t-Test.\r\n\r\nThe results presented in the table above show that the Student\u2019s t-Statistic value is 16.91 and the probability associated with this estimate is &lt;0.001. Together these values indicate that the observed sample mean is significantly different than the hypothesized expected mean for the population (set at Mu<sub>0<\/sub>=0) from which the sample was drawn.\r\n\r\nHowever, what if we wanted to establish a suggested value for the population mean that is not 0, but that is based on value reported in the literature?\u00a0 In this case, we could assign a suggested value to the population mean and then compare the observed mean for the sample to the expected value for a population.\u00a0 In the following code, we test this notion.\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Assign a suggested value to the population mean<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nPROC TTEST H0=54\r\nPLOTS(SHOWH0)\r\nALPHA=0.05;\r\nVAR SCORE;\r\nRUN;\r\n\r\n<\/div>\r\n<\/div>\r\nThe SAS output is given below. The results indicate that the average score for the sample ([latex]{\\overline{x}}[\/latex] = 61.80) is not significantly different at the probability level of p &lt; 0.05 than the expected score of ([latex]{\\mu}[\/latex] =54). Notice, in addition to the table of output SAS also includes a graph illustrating the shape of the distribution and the comparison of the sample estimate to the expected population estimate of centrality.\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr>\r\n<td><strong>The t-test Procedure<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>DF<\/strong><\/td>\r\n<td><strong>t\u00a0Value<\/strong><\/td>\r\n<td><strong>Pr\u00a0&gt;\u00a0|t|<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>9<\/td>\r\n<td>2.13<\/td>\r\n<td>0.0615<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div style=\"text-align: center\" align=\"center\">\r\n<table>\r\n<thead>\r\n<tr>\r\n<td><strong>Parameter estimates<\/strong><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<\/tr>\r\n<tr class=\"shaded\">\r\n<td style=\"text-align: center\"><strong>Mean<\/strong><\/td>\r\n<td style=\"text-align: center\"><strong>95% CL Mean<\/strong><\/td>\r\n<td style=\"text-align: center\"><\/td>\r\n<\/tr>\r\n<tr>\r\n<td>61.8000<\/td>\r\n<td>Lower limit: 53.5343<\/td>\r\n<td>Upper Limit: 70.0657<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<\/table>\r\nConsidering that the confidence interval shown here includes the mean for the sample (61.8) and the mean for the population which we set apriori as 54, no significant difference is observed, between that which is expected and that which was observed. This estimate is illustrated in the following graph.\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/04\/ttest1-300x225.png\" alt=\"\" class=\"aligncenter wp-image-843\" width=\"479\" height=\"359\" \/>\r\n\r\n<hr \/>\r\n\r\n<h2 style=\"text-align: left\">Calculate the Mean for A Frequency Distribution<\/h2>\r\nIn the following example, we compute the mean for frequency distribution. The formula to compute the mean of a frequency distribution is shown here as:\r\n\r\n[latex]{\\overline{x}} = {\\Sigma{f<span>x_i}<\/span>\\over{n}}[\/latex]\r\n\r\nWhere:\r\n<ul>\r\n \t<li>f refers to the frequency in each interval<\/li>\r\n \t<li>xi refers to the mid-point of the interval<\/li>\r\n \t<li>i refers to the \u201cith\u201d case within the distribution<\/li>\r\n \t<li>n refers to all of the cases within the distribution.<\/li>\r\n<\/ul>\r\nBelow is the frequency distribution table for the heights of 200 individuals. The data represent heights recorded in centimetres and organized into seven categories. The SAS code to compute the mean for this set of data is shown below the table. Notice that the table is reduced to a simple composition of two variables which includes the mid-point of the category represented by the variable: GRPMDPT, and the number of individuals, whose height scores fall within the specific category, represented by the variable: COUNTS.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Column 1\r\n\r\ncell boundaries<\/td>\r\n<td>Column 2 frequency (f)<\/td>\r\n<td>Column 3\r\n\r\ncell mid-point<\/td>\r\n<td>Column 4\r\n\r\n(f) x cell midpoint<\/td>\r\n<td>Column 5\r\n\r\n(col 4 \u00f7 n)<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>158.5 \u2013 161.5<\/td>\r\n<td>4<\/td>\r\n<td>160<\/td>\r\n<td>4 x 160 = 640<\/td>\r\n<td>640\/200 = 3.2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>161.5 \u2013 164.5<\/td>\r\n<td>12<\/td>\r\n<td>163<\/td>\r\n<td>12 x 163 = 1956<\/td>\r\n<td>1956\/200 = 9.78<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>164.5 \u2013 167.5<\/td>\r\n<td>44<\/td>\r\n<td>166<\/td>\r\n<td>44 x 166 = 7304<\/td>\r\n<td>7304\/200 = 36.52<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>167.5 \u2013 170.5<\/td>\r\n<td>64<\/td>\r\n<td>169<\/td>\r\n<td>64 x 169 = 10816<\/td>\r\n<td>10816\/200 = 54.08<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>170.5 \u2013 173.5<\/td>\r\n<td>56<\/td>\r\n<td>172<\/td>\r\n<td>56 x 172 = 9632<\/td>\r\n<td>9632\/200 = 48.16<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>173.5 \u2013 176.5<\/td>\r\n<td>16<\/td>\r\n<td>175<\/td>\r\n<td>16 x 175 = 2800<\/td>\r\n<td>2800\/200 = 14.00<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>176.5 \u2013 179.5<\/td>\r\n<td>4<\/td>\r\n<td>178<\/td>\r\n<td>4 x 178 = 712<\/td>\r\n<td>712\/200 = 3.56<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>[latex]{\\overline{x}} = {\\Sigma{f<span>x_i}<\/span>\\over{n}}[\/latex]<\/td>\r\n<td>[latex]{\\overline{x}} = {33860\\over 200}[\/latex]<\/td>\r\n<td>\u00a0= 169.3<\/td>\r\n<td>The [latex]{\\overline{x}}[\/latex] is the sum of column 5<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">The SAS code to compute the mean for data in the table above<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nDATA FREQMN;\r\nINPUT GRPMDPT COUNTS @@;\r\nCRSPRDCT= GRPMDPT*COUNTS;\r\n\/* COMPUTE RATIO FOR THE CROSS PRODUCT USING GROUP MIDPOINT X CELL FREQUENCY *\/\r\nXP_RATIO=CRSPRDCT\/200;\r\nLABEL GRPMDPT = \u2018GROUP MIDPOINT\u2019\r\nCOUNTS = \u2018NUMBER OF CASES PER CELL\u2019\r\nCRSPRDCT = \u2018CROSS PRODUCT PER CELL\u2019\r\nXP_RATIO = 'CROSS PRODUCT RATIO';\r\nDATALINES;\r\n160 4 163 12 166 44 169 64 172 56 175 16 178 4\r\n;\r\nPROC PRINT;\r\nVAR GRPMDPT COUNTS CRSPRDCT XP_RATIO;\r\nSUM CRSPRDCT XP_RATIO;\r\nFOOTNOTE1 \"* THE MEAN IS PRODUCED AS THE SUM OF THE VARIABLE XP_RATIO\";\r\nFOOTNOTE2 \"** THE MEAN CAN ALSO BE CALCULATED FROM THE SUM OF THE VARIABLE CRSPRDCT \u00f7 200\";\r\nRUN;\r\n\r\n<\/div>\r\n<\/div>\r\nThe output generated by the SAS program above is the table of raw data presented in column form and includes the sums of the columns used to compute the mean for the frequency distribution.\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr>\r\n<td><strong>Obs<\/strong><\/td>\r\n<td><strong>grpmdpt<\/strong><\/td>\r\n<td><strong>counts<\/strong><\/td>\r\n<td><strong>crsprdct<\/strong><\/td>\r\n<td><strong>cp_ratio<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td><strong>1<\/strong><\/td>\r\n<td>160<\/td>\r\n<td>4<\/td>\r\n<td>640<\/td>\r\n<td>3.20<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>2<\/strong><\/td>\r\n<td>163<\/td>\r\n<td>12<\/td>\r\n<td>1956<\/td>\r\n<td>9.78<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>3<\/strong><\/td>\r\n<td>166<\/td>\r\n<td>44<\/td>\r\n<td>7304<\/td>\r\n<td>36.52<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>4<\/strong><\/td>\r\n<td>169<\/td>\r\n<td>64<\/td>\r\n<td>10816<\/td>\r\n<td>54.08<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>5<\/strong><\/td>\r\n<td>172<\/td>\r\n<td>56<\/td>\r\n<td>9632<\/td>\r\n<td>48.16<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>6<\/strong><\/td>\r\n<td>175<\/td>\r\n<td>16<\/td>\r\n<td>2800<\/td>\r\n<td>14.00<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>7<\/strong><\/td>\r\n<td>178<\/td>\r\n<td>4<\/td>\r\n<td>712<\/td>\r\n<td>3.56<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><strong>33860<\/strong><\/td>\r\n<td><strong>169.30<\/strong><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n* The mean is produced as the sum of the variable <strong><em>XP_RATIO<\/em><\/strong>\r\n\r\n** The mean can also be calculated from the sum of the variable <strong><em>crsprdct<\/em><\/strong> \u00f7 200\r\n\r\n<hr \/>\r\n\r\n<h2 style=\"text-align: left\">The Weighted Mean Score<\/h2>\r\nIn some situations, we may wish to combine means from several samples. Under such circumstances, we need to consider the sample size (or weight) of the distribution from which the means were drawn. By adjusting each independent sample mean by the number of subjects in the respective sample from which the means were drawn, we are able to provide different relative contributions of each mean to the total mean of all samples combined. The formula for a weighted mean from two samples is shown here. The formula for the mean of a sample is shown here:\r\n\r\n[latex]{\\overline{x}}=<span>{n_i\\times{\\overline{x_1}}+n_2{\\overline{x_2}}<\/span>\\over{n_1 + n_2}}[\/latex]\r\n\r\n<hr \/>\r\n\r\n<h2 style=\"text-align: left\">The Median Score<\/h2>\r\nThe median score is also a measure of central tendency, and it is defined as the middle score in a set of ordered scores.\u00a0 In the example below, we begin with a set of scores (an array), we next sort the scores from lowest to highest.\u00a0 Then we identify the number that is in the middle of the ordered set of scores where half the numbers are above the identified middle score, and half the numbers are below the identified middle score.\r\n\r\nExample: Median\r\n\r\nThe <em>median<\/em> is the middle score. Considering the heart rate values again, we put these readings in order of magnitude and then identify which value is in the middle:\r\n<ul>\r\n \t<li>57<\/li>\r\n \t<li>59<\/li>\r\n \t<li>59<\/li>\r\n \t<li>75<\/li>\r\n \t<li>78<\/li>\r\n \t<li>78<\/li>\r\n \t<li>85<\/li>\r\n \t<li>88<\/li>\r\n \t<li>88<\/li>\r\n \t<li>88<\/li>\r\n<\/ul>\r\nIn this case, we have an even number of values (<em>n<\/em> = 10) so we can calculate the average of the two values in the middle. It just so happens that they are the same value in this example (78) so the median is 78.\r\n<ul>\r\n \t<li>initial array of scores: {12, 72, 56, 34, 35, 13, 36, 16, 67}<\/li>\r\n \t<li>sorted array of scores: {12, 13, 16, 34, 35, 36, 56, 67, 72}<\/li>\r\n \t<li>sorted array of scores: {12, 13, 16, 34, 35, 36, 56, 67, 72}<\/li>\r\n<\/ul>\r\nNotice in the example above, regardless of the actual scores, the middle score in the ordered set of scores is the median, which in this set is 35.\r\n\r\nWhen we have an even number of scores in our array there is a special caveat to identifying the median score in the distribution (set of scores). When we have two scores selected as the identified middle score we simply compute the average between the two identified middle scores and use that number as the median score.\u00a0 That is, we add the two middle scores together and divide by 2.\r\n<ul>\r\n \t<li>initial array of scores: {22, 32, 86, 44, 25, 13, 16, 18, 47, 11}<\/li>\r\n \t<li>sorted array of scores: {11, 13, 16, 18, 22, 25, 32, 44, 47, 86}<\/li>\r\n \t<li>computed median for the array: {11, 13, 16, 18, 22, 23.5, 25, 32, 44, 47, 86}<\/li>\r\n<\/ul>\r\n\r\n<hr \/>\r\n\r\n<h2>The Mode Score<\/h2>\r\nThe mode score is the third measure of central tendency, and it is defined as the most frequently occurring score in a set of scores. In the example below, we simply count the number of scores that are the same within a set of scores, within an array or within a distribution.\r\n\r\nBelow are 10 resting heart rate values:\r\n\r\n78, 88, 57, 59, 75, 85, 88, 78, 59, 88\r\n\r\nThe mode is 88 because it appears most often.\r\n\r\nIn the following example of 16 scores, the number 2 occurs 3 times, but the number 27 occurs 4 times therefore we would identify 27 as the mode score.\r\n\r\n<strong><span style=\"color: #ff6600\">2<\/span>, <span style=\"color: #ff6600\">2<\/span>, <span style=\"color: #ff6600\">2<\/span>,<\/strong> 5, 6, 14, 15, 23, 26, <strong><span style=\"color: #0000ff\">27,<\/span> <span style=\"color: #0000ff\">27<\/span>, <span style=\"color: #0000ff\">27<\/span>, <span style=\"color: #0000ff\">27<\/span><\/strong>, 28, 37, 41\r\n\r\n<hr \/>\r\n\r\n&nbsp;\r\n\r\n<\/div>","rendered":"<h1>PART 1: Measures of Central Tendency<\/h1>\n<p>The most common measure of central tendency is the <strong>mean <\/strong>or <strong>average <\/strong>score. The mean is a calculated score that is intended to represent all of the scores in the distribution (set of scores).<\/p>\n<p>The formula for the mean of a sample is shown here:<\/p>\n<p>[latex]{\\overline{x}} = <span>\\Sigma{(x_i)<\/span>\\over{n}}[\/latex]<code><span class=\"pun\"><\/span><\/code><\/p>\n<p>Where:<\/p>\n<ul>\n<li>[latex]{\\overline{x}}[\/latex] refers to the sample mean<\/li>\n<li>[latex]<span>\\Sigma{(x_i)<\/span>} refers to the sum of all the scores<\/li>\n<li>i refers to the \u201cith\u201d case within the distribution<\/li>\n<li>n refers to all of the cases within the distribution.<\/li>\n<\/ul>\n<p>  To calculate the mean for a continuous variable, add up all of the values and divide the sum of values by the number of values. Below is a set of blood glucose measures for 5 patients. These data are represented in millimoles per litre (mmol\/L). P<sub>n<\/sub> represents the nominal value label for each patient, so that P1 is patient 1.    P<sub>1<\/sub> 4.2 mmol\/L, P<sub>2<\/sub> 5.6 mmol\/L, P<sub>3<\/sub> 7.9 mmol\/L, P<sub>4<\/sub> 10.2 mmol\/L, P<sub>5<\/sub> 7.5 mmol\/L,    Follow these steps to calculate the mean:  <\/p>\n<ul>\n<li>First add the values together: 4.2 + 5.6 + 7.9 + 10.2 + 7.5 = 35.4.<\/li>\n<li>Next, divide by the number of values (to produce the average): 35.4\/5 = 7.08 mmol\/L<\/li>\n<\/ul>\n<p>  We can also use SAS to compute the mean for a set of scores. Two specific SAS programs that process measures of central tendency are PROC MEANS, and PROC UNIVARIATE. Each of these programs was designed to produce descriptive statistics for a sample of scores. Below are the SAS commands to compute the mean for a set of 10 resting heart rate scores. In this first program we used the SAS procedural command PROC MEANS to compute three basic estimates: the mean, the standard deviation and the minimum\/maximum scores for the sample dataset of 10 numbers.  <\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">SAS PROC MEANS to Produce Descriptive Statistics for a Sample of 10 Numbers<\/header>\n<div class=\"textbox__content\">DATA MN_HR;  INPUT ID SCORE @@;  DATALINES;  01 48 02 54 03 66 04 72 05 56 06 68 07 48 08 67 09 55 10 84  ;  PROC MEANS DATA=MN_HR;  VAR SCORE;  RUN;<\/div>\n<\/p><\/div>\n<p>  Notice in the code written above, the semi-colon (;) is placed on a separate line below the set of scores.    While PROC MEANS, in its simplest form (without options) provides three basic estimates that describe estimates within a distribution, the SAS procedural command PROC UNIVARIATE not only computes the mean but also creates the Basic Statistical Measures Table which provides an entire summary of descriptive statistics. The output generated by the SAS program above \u2013 using the PROC MEANS statement without options \u2013 produced a table of summary estimates that included the mean and standard deviation as well as the minimum and maximum values for the dataset.    <strong>SAS Output from the MEANS Procedure: Variable of interest was Heart Rate<\/strong>  <\/p>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr>\n<td style=\"width: 187.05px\"><strong>N<\/strong><\/td>\n<td style=\"width: 51.85px\"><strong>Mean<\/strong><\/td>\n<td style=\"width: 67.05px\"><strong>Std Dev<\/strong><\/td>\n<td style=\"width: 82.25px\"><strong>Minimum<\/strong><\/td>\n<td style=\"width: 83.85px\"><strong>Maximum<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"width: 187.05px\">10<\/td>\n<td style=\"width: 51.85px\">61.80<\/td>\n<td style=\"width: 67.05px\">11.56<\/td>\n<td style=\"width: 82.25px\">48.00<\/td>\n<td style=\"width: 83.85px\">84.00<\/td>\n<\/tr>\n<\/tbody>\n<\/table><\/div>\n<p>  When we call the PROC UNIVARIATE procedure of SAS, the output is a more complete table of summaries that include estimates of centrality but also the moments, measures of variance, and the tests of the location of the mean, as shown below.  <\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">SAS PROC UNIVARIATE to Produce Descriptive Statistics for a Sample of 10 Numbers<\/p>\n<\/header>\n<div class=\"textbox__content\">    PROC UNIVARIATE DATA=MN_HR;  VAR SCORE;  RUN;    <\/div>\n<\/p><\/div>\n<p>  <strong>The UNIVARIATE Procedure -- <\/strong><strong>Variable: SCORE<\/strong>  <\/p>\n<div style=\"margin: auto;\">\n<table>\n<tbody>\n<tr>\n<td style=\"width: 107.85px\"><strong>\u00a0MOMENTS\u00a0<\/strong><\/td>\n<td style=\"width: 76.65px\"><\/td>\n<td style=\"width: 126.25px\"><strong>\u00a0<\/strong><\/td>\n<td style=\"width: 73.45px\"><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>N<\/strong><\/td>\n<td style=\"width: 76.65px\">10<\/td>\n<td style=\"width: 126.25px\"><strong>Sum Weights<\/strong><\/td>\n<td style=\"width: 73.45px\">10<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Mean<\/strong><\/td>\n<td style=\"width: 76.65px\">61.8<\/td>\n<td style=\"width: 126.25px\"><strong>Sum Observations<\/strong><\/td>\n<td style=\"width: 73.45px\">618<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Std Deviation<\/strong><\/td>\n<td style=\"width: 76.65px\">11.5547008<\/td>\n<td style=\"width: 126.25px\"><strong>Variance<\/strong><\/td>\n<td style=\"width: 73.45px\">133.511111<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Skewness<\/strong><\/td>\n<td style=\"width: 76.65px\">0.55954538<\/td>\n<td style=\"width: 126.25px\"><strong>Kurtosis<\/strong><\/td>\n<td style=\"width: 73.45px\">-0.2284272<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Uncorrected SS<\/strong><\/td>\n<td style=\"width: 76.65px\">39394<\/td>\n<td style=\"width: 126.25px\"><strong>Corrected SS<\/strong><\/td>\n<td style=\"width: 73.45px\">1201.6<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Coeff Variation<\/strong><\/td>\n<td style=\"width: 76.65px\">18.6969269<\/td>\n<td style=\"width: 126.25px\"><strong>Std Error Mean<\/strong><\/td>\n<td style=\"width: 73.45px\">3.65391723<\/td>\n<\/tr>\n<\/tbody>\n<\/table><\/div>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr>\n<td><strong>Tests\u00a0for\u00a0Location:\u00a0Mu0=0<\/strong><\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td><strong>Test<\/strong><\/td>\n<td><strong>STATISTIC<\/strong><\/td>\n<td><strong>ESTIMATE  <\/strong><\/td>\n<td><\/td>\n<td><strong>p Value<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Student's t<\/strong><\/td>\n<td><strong>t<\/strong><\/td>\n<td>16.91336<\/td>\n<td><strong>Pr > |t|<\/strong><\/td>\n<td><.0001<\/td>\n<\/tr>\n<tr>\n<td><strong>Sign<\/strong><\/td>\n<td><strong>M<\/strong><\/td>\n<td>5<\/td>\n<td><strong>Pr >= |M|<\/strong><\/td>\n<td>0.0020<\/td>\n<\/tr>\n<tr>\n<td><strong>Signed Rank<\/strong><\/td>\n<td><strong>S<\/strong><\/td>\n<td>27.5<\/td>\n<td><strong>Pr >= |S|<\/strong><\/td>\n<td>0.0020<\/td>\n<\/tr>\n<\/tbody>\n<\/table><\/div>\n<hr \/>\n<h2>Comparing the Mean for a Sample to the Expected Mean for a Population<\/h2>\n<p>  In the output from the PROC UNIVARIATE procedure, SAS includes a table in which the mean for the variable: SCORE is compared to the mean for the Standard Normal Distribution (SND). The SND represents the hypothetical population mean and has a value of 0 with a standard deviation of 1.\u00a0 In the SAS table shown above, entitled <strong>Tests\u00a0for\u00a0Location:\u00a0Mu0=0<\/strong> the comparison of the sample mean ([latex]{\\overline{x}}[\/latex] ) to the population ([latex]{\\mu}[\/latex] ) is evaluated with the Student\u2019s t-Test.<\/p>\n<p>The results presented in the table above show that the Student\u2019s t-Statistic value is 16.91 and the probability associated with this estimate is &lt;0.001. Together these values indicate that the observed sample mean is significantly different than the hypothesized expected mean for the population (set at Mu<sub>0<\/sub>=0) from which the sample was drawn.<\/p>\n<p>However, what if we wanted to establish a suggested value for the population mean that is not 0, but that is based on value reported in the literature?\u00a0 In this case, we could assign a suggested value to the population mean and then compare the observed mean for the sample to the expected value for a population.\u00a0 In the following code, we test this notion.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Assign a suggested value to the population mean<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>PROC TTEST H0=54<br \/>\nPLOTS(SHOWH0)<br \/>\nALPHA=0.05;<br \/>\nVAR SCORE;<br \/>\nRUN;<\/p>\n<\/div>\n<\/div>\n<p>The SAS output is given below. The results indicate that the average score for the sample ([latex]{\\overline{x}}[\/latex] = 61.80) is not significantly different at the probability level of p &lt; 0.05 than the expected score of ([latex]{\\mu}[\/latex] =54). Notice, in addition to the table of output SAS also includes a graph illustrating the shape of the distribution and the comparison of the sample estimate to the expected population estimate of centrality.<\/p>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr>\n<td><strong>The t-test Procedure<\/strong><\/td>\n<\/tr>\n<tr>\n<td><strong>DF<\/strong><\/td>\n<td><strong>t\u00a0Value<\/strong><\/td>\n<td><strong>Pr\u00a0&gt;\u00a0|t|<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>9<\/td>\n<td>2.13<\/td>\n<td>0.0615<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div style=\"text-align: center; margin: auto;\">\n<table>\n<thead>\n<tr>\n<td><strong>Parameter estimates<\/strong><\/td>\n<td><\/td>\n<td><\/td>\n<\/tr>\n<tr class=\"shaded\">\n<td style=\"text-align: center\"><strong>Mean<\/strong><\/td>\n<td style=\"text-align: center\"><strong>95% CL Mean<\/strong><\/td>\n<td style=\"text-align: center\"><\/td>\n<\/tr>\n<tr>\n<td>61.8000<\/td>\n<td>Lower limit: 53.5343<\/td>\n<td>Upper Limit: 70.0657<\/td>\n<\/tr>\n<\/thead>\n<\/table>\n<p>Considering that the confidence interval shown here includes the mean for the sample (61.8) and the mean for the population which we set apriori as 54, no significant difference is observed, between that which is expected and that which was observed. This estimate is illustrated in the following graph.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/04\/ttest1-300x225.png\" alt=\"\" class=\"aligncenter wp-image-843\" width=\"479\" height=\"359\" srcset=\"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/04\/ttest1-300x225.png 300w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/04\/ttest1-768x576.png 768w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/04\/ttest1-65x49.png 65w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/04\/ttest1-225x169.png 225w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/04\/ttest1-350x263.png 350w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/04\/ttest1.png 1000w\" sizes=\"auto, (max-width: 479px) 100vw, 479px\" \/><\/p>\n<hr \/>\n<h2 style=\"text-align: left\">Calculate the Mean for A Frequency Distribution<\/h2>\n<p>In the following example, we compute the mean for frequency distribution. The formula to compute the mean of a frequency distribution is shown here as:<\/p>\n<p>[latex]{\\overline{x}} = {\\Sigma{f<span>x_i}<\/span>\\over{n}}[\/latex]<\/p>\n<p>Where:<\/p>\n<ul>\n<li>f refers to the frequency in each interval<\/li>\n<li>xi refers to the mid-point of the interval<\/li>\n<li>i refers to the \u201cith\u201d case within the distribution<\/li>\n<li>n refers to all of the cases within the distribution.<\/li>\n<\/ul>\n<p>Below is the frequency distribution table for the heights of 200 individuals. The data represent heights recorded in centimetres and organized into seven categories. The SAS code to compute the mean for this set of data is shown below the table. Notice that the table is reduced to a simple composition of two variables which includes the mid-point of the category represented by the variable: GRPMDPT, and the number of individuals, whose height scores fall within the specific category, represented by the variable: COUNTS.<\/p>\n<table>\n<tbody>\n<tr>\n<td>Column 1<\/p>\n<p>cell boundaries<\/td>\n<td>Column 2 frequency (f)<\/td>\n<td>Column 3<\/p>\n<p>cell mid-point<\/td>\n<td>Column 4<\/p>\n<p>(f) x cell midpoint<\/td>\n<td>Column 5<\/p>\n<p>(col 4 \u00f7 n)<\/td>\n<\/tr>\n<tr>\n<td>158.5 \u2013 161.5<\/td>\n<td>4<\/td>\n<td>160<\/td>\n<td>4 x 160 = 640<\/td>\n<td>640\/200 = 3.2<\/td>\n<\/tr>\n<tr>\n<td>161.5 \u2013 164.5<\/td>\n<td>12<\/td>\n<td>163<\/td>\n<td>12 x 163 = 1956<\/td>\n<td>1956\/200 = 9.78<\/td>\n<\/tr>\n<tr>\n<td>164.5 \u2013 167.5<\/td>\n<td>44<\/td>\n<td>166<\/td>\n<td>44 x 166 = 7304<\/td>\n<td>7304\/200 = 36.52<\/td>\n<\/tr>\n<tr>\n<td>167.5 \u2013 170.5<\/td>\n<td>64<\/td>\n<td>169<\/td>\n<td>64 x 169 = 10816<\/td>\n<td>10816\/200 = 54.08<\/td>\n<\/tr>\n<tr>\n<td>170.5 \u2013 173.5<\/td>\n<td>56<\/td>\n<td>172<\/td>\n<td>56 x 172 = 9632<\/td>\n<td>9632\/200 = 48.16<\/td>\n<\/tr>\n<tr>\n<td>173.5 \u2013 176.5<\/td>\n<td>16<\/td>\n<td>175<\/td>\n<td>16 x 175 = 2800<\/td>\n<td>2800\/200 = 14.00<\/td>\n<\/tr>\n<tr>\n<td>176.5 \u2013 179.5<\/td>\n<td>4<\/td>\n<td>178<\/td>\n<td>4 x 178 = 712<\/td>\n<td>712\/200 = 3.56<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>[latex]{\\overline{x}} = {\\Sigma{f<span>x_i}<\/span>\\over{n}}[\/latex]<\/td>\n<td>[latex]{\\overline{x}} = {33860\\over 200}[\/latex]<\/td>\n<td>\u00a0= 169.3<\/td>\n<td>The [latex]{\\overline{x}}[\/latex] is the sum of column 5<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">The SAS code to compute the mean for data in the table above<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>DATA FREQMN;<br \/>\nINPUT GRPMDPT COUNTS @@;<br \/>\nCRSPRDCT= GRPMDPT*COUNTS;<br \/>\n\/* COMPUTE RATIO FOR THE CROSS PRODUCT USING GROUP MIDPOINT X CELL FREQUENCY *\/<br \/>\nXP_RATIO=CRSPRDCT\/200;<br \/>\nLABEL GRPMDPT = \u2018GROUP MIDPOINT\u2019<br \/>\nCOUNTS = \u2018NUMBER OF CASES PER CELL\u2019<br \/>\nCRSPRDCT = \u2018CROSS PRODUCT PER CELL\u2019<br \/>\nXP_RATIO = 'CROSS PRODUCT RATIO';<br \/>\nDATALINES;<br \/>\n160 4 163 12 166 44 169 64 172 56 175 16 178 4<br \/>\n;<br \/>\nPROC PRINT;<br \/>\nVAR GRPMDPT COUNTS CRSPRDCT XP_RATIO;<br \/>\nSUM CRSPRDCT XP_RATIO;<br \/>\nFOOTNOTE1 \"* THE MEAN IS PRODUCED AS THE SUM OF THE VARIABLE XP_RATIO\";<br \/>\nFOOTNOTE2 \"** THE MEAN CAN ALSO BE CALCULATED FROM THE SUM OF THE VARIABLE CRSPRDCT \u00f7 200\";<br \/>\nRUN;<\/p>\n<\/div>\n<\/div>\n<p>The output generated by the SAS program above is the table of raw data presented in column form and includes the sums of the columns used to compute the mean for the frequency distribution.<\/p>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr>\n<td><strong>Obs<\/strong><\/td>\n<td><strong>grpmdpt<\/strong><\/td>\n<td><strong>counts<\/strong><\/td>\n<td><strong>crsprdct<\/strong><\/td>\n<td><strong>cp_ratio<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>1<\/strong><\/td>\n<td>160<\/td>\n<td>4<\/td>\n<td>640<\/td>\n<td>3.20<\/td>\n<\/tr>\n<tr>\n<td><strong>2<\/strong><\/td>\n<td>163<\/td>\n<td>12<\/td>\n<td>1956<\/td>\n<td>9.78<\/td>\n<\/tr>\n<tr>\n<td><strong>3<\/strong><\/td>\n<td>166<\/td>\n<td>44<\/td>\n<td>7304<\/td>\n<td>36.52<\/td>\n<\/tr>\n<tr>\n<td><strong>4<\/strong><\/td>\n<td>169<\/td>\n<td>64<\/td>\n<td>10816<\/td>\n<td>54.08<\/td>\n<\/tr>\n<tr>\n<td><strong>5<\/strong><\/td>\n<td>172<\/td>\n<td>56<\/td>\n<td>9632<\/td>\n<td>48.16<\/td>\n<\/tr>\n<tr>\n<td><strong>6<\/strong><\/td>\n<td>175<\/td>\n<td>16<\/td>\n<td>2800<\/td>\n<td>14.00<\/td>\n<\/tr>\n<tr>\n<td><strong>7<\/strong><\/td>\n<td>178<\/td>\n<td>4<\/td>\n<td>712<\/td>\n<td>3.56<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<td><strong>33860<\/strong><\/td>\n<td><strong>169.30<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>* The mean is produced as the sum of the variable <strong><em>XP_RATIO<\/em><\/strong><\/p>\n<p>** The mean can also be calculated from the sum of the variable <strong><em>crsprdct<\/em><\/strong> \u00f7 200<\/p>\n<hr \/>\n<h2 style=\"text-align: left\">The Weighted Mean Score<\/h2>\n<p>In some situations, we may wish to combine means from several samples. Under such circumstances, we need to consider the sample size (or weight) of the distribution from which the means were drawn. By adjusting each independent sample mean by the number of subjects in the respective sample from which the means were drawn, we are able to provide different relative contributions of each mean to the total mean of all samples combined. The formula for a weighted mean from two samples is shown here. The formula for the mean of a sample is shown here:<\/p>\n<p>[latex]{\\overline{x}}=<span>{n_i\\times{\\overline{x_1}}+n_2{\\overline{x_2}}<\/span>\\over{n_1 + n_2}}[\/latex]<\/p>\n<hr \/>\n<h2 style=\"text-align: left\">The Median Score<\/h2>\n<p>The median score is also a measure of central tendency, and it is defined as the middle score in a set of ordered scores.\u00a0 In the example below, we begin with a set of scores (an array), we next sort the scores from lowest to highest.\u00a0 Then we identify the number that is in the middle of the ordered set of scores where half the numbers are above the identified middle score, and half the numbers are below the identified middle score.<\/p>\n<p>Example: Median<\/p>\n<p>The <em>median<\/em> is the middle score. Considering the heart rate values again, we put these readings in order of magnitude and then identify which value is in the middle:<\/p>\n<ul>\n<li>57<\/li>\n<li>59<\/li>\n<li>59<\/li>\n<li>75<\/li>\n<li>78<\/li>\n<li>78<\/li>\n<li>85<\/li>\n<li>88<\/li>\n<li>88<\/li>\n<li>88<\/li>\n<\/ul>\n<p>In this case, we have an even number of values (<em>n<\/em> = 10) so we can calculate the average of the two values in the middle. It just so happens that they are the same value in this example (78) so the median is 78.<\/p>\n<ul>\n<li>initial array of scores: {12, 72, 56, 34, 35, 13, 36, 16, 67}<\/li>\n<li>sorted array of scores: {12, 13, 16, 34, 35, 36, 56, 67, 72}<\/li>\n<li>sorted array of scores: {12, 13, 16, 34, 35, 36, 56, 67, 72}<\/li>\n<\/ul>\n<p>Notice in the example above, regardless of the actual scores, the middle score in the ordered set of scores is the median, which in this set is 35.<\/p>\n<p>When we have an even number of scores in our array there is a special caveat to identifying the median score in the distribution (set of scores). When we have two scores selected as the identified middle score we simply compute the average between the two identified middle scores and use that number as the median score.\u00a0 That is, we add the two middle scores together and divide by 2.<\/p>\n<ul>\n<li>initial array of scores: {22, 32, 86, 44, 25, 13, 16, 18, 47, 11}<\/li>\n<li>sorted array of scores: {11, 13, 16, 18, 22, 25, 32, 44, 47, 86}<\/li>\n<li>computed median for the array: {11, 13, 16, 18, 22, 23.5, 25, 32, 44, 47, 86}<\/li>\n<\/ul>\n<hr \/>\n<h2>The Mode Score<\/h2>\n<p>The mode score is the third measure of central tendency, and it is defined as the most frequently occurring score in a set of scores. In the example below, we simply count the number of scores that are the same within a set of scores, within an array or within a distribution.<\/p>\n<p>Below are 10 resting heart rate values:<\/p>\n<p>78, 88, 57, 59, 75, 85, 88, 78, 59, 88<\/p>\n<p>The mode is 88 because it appears most often.<\/p>\n<p>In the following example of 16 scores, the number 2 occurs 3 times, but the number 27 occurs 4 times therefore we would identify 27 as the mode score.<\/p>\n<p><strong><span style=\"color: #ff6600\">2<\/span>, <span style=\"color: #ff6600\">2<\/span>, <span style=\"color: #ff6600\">2<\/span>,<\/strong> 5, 6, 14, 15, 23, 26, <strong><span style=\"color: #0000ff\">27,<\/span> <span style=\"color: #0000ff\">27<\/span>, <span style=\"color: #0000ff\">27<\/span>, <span style=\"color: #0000ff\">27<\/span><\/strong>, 28, 37, 41<\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<\/div>\n","protected":false},"author":56,"menu_order":1,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-412","chapter","type-chapter","status-publish","hentry"],"part":401,"_links":{"self":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters\/412","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/users\/56"}],"version-history":[{"count":48,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters\/412\/revisions"}],"predecessor-version":[{"id":2152,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters\/412\/revisions\/2152"}],"part":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/parts\/401"}],"metadata":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters\/412\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/media?parent=412"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapter-type?post=412"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/contributor?post=412"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/license?post=412"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}