{"id":973,"date":"2020-06-02T07:49:50","date_gmt":"2020-06-02T11:49:50","guid":{"rendered":"http:\/\/pressbooks.library.upei.ca\/montelpare\/?post_type=chapter&#038;p=973"},"modified":"2020-08-30T08:49:53","modified_gmt":"2020-08-30T12:49:53","slug":"the-t-test-for-independent-sample-means-and-pooled-versus-unpooled-variance","status":"publish","type":"chapter","link":"https:\/\/pressbooks.library.upei.ca\/montelpare\/chapter\/the-t-test-for-independent-sample-means-and-pooled-versus-unpooled-variance\/","title":{"raw":"The t-test for Independent Sample Means and Pooled Versus Unpooled Variance","rendered":"The t-test for Independent Sample Means and Pooled Versus Unpooled Variance"},"content":{"raw":"<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Learner Outcomes<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nAfter reading this chapter you should be able to:\r\n<ol>\r\n \t<li>Compute the significance of the difference between two sample means when the sample variances are different<\/li>\r\n \t<li>Compute the t-test for independent sample means<\/li>\r\n \t<li>Compute the t-tests for pooled versus un-pooled variance.<\/li>\r\n \t<li>Write a SAS program to compute and identify the important elements of the output for the computation<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<h2>Applications of the t-test under different research scenarios<\/h2>\r\nIn statistics, the t-test has a simple approach despite that it uses a variety of error terms in the denominator as shown in Table 30.1, below. Depending on the research design, the error term will differ to ensure that the appropriate variance estimates within each of the samples are included in the analyses. The following equations demonstrate the different error terms related to the types of comparisons.\r\n\r\n<strong>Table 30.1 A Summary of t-test Formulae<\/strong>\r\n<table style=\"border-collapse: collapse;width: 100%;height: 380px\" border=\"0\">\r\n<thead>\r\n<tr class=\"shaded\" style=\"height: 15px\">\r\n<td style=\"width: 50%;text-align: center;height: 15px\">t-test descriptions<\/td>\r\n<td class=\"border\" style=\"width: 50%;text-align: center;height: 15px;vertical-align: middle\" scope=\"col\">Appropriate t-test formula<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr class=\"border\" style=\"height: 30px\">\r\n<td style=\"width: 50%;height: 30px\">Evaluation of the single sample mean versus the mean for a population<\/td>\r\n<td class=\"border\" style=\"width: 50%;height: 30px;text-align: center;vertical-align: middle\" scope=\"col\"><img src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t1.png\" alt=\"\" width=\"500\" height=\"106\" class=\"aligncenter size-full wp-image-2014\" \/><\/td>\r\n<\/tr>\r\n<tr class=\"border\" style=\"height: 151px\">\r\n<td style=\"width: 50%;height: 151px\">The pairwise t-test uses the average difference in the measure of interest, from the pre-test score to the post-test score, and then divided by the standard error of the average difference.\u00a0 The standard error of the average difference is computed by dividing the standard deviation of the average difference by the square root of the number of cases in the pairwise comparison.<\/td>\r\n<td class=\"border\" style=\"width: 50%;height: 151px;text-align: center;vertical-align: middle\" scope=\"col\"><img src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t2.png\" alt=\"\" width=\"450\" height=\"128\" class=\"aligncenter size-full wp-image-2013\" \/><\/td>\r\n<\/tr>\r\n<tr class=\"border\" style=\"height: 91px\">\r\n<td style=\"width: 50%;height: 91px\">To evaluate the significance of the difference between two mean scores (regardless of the size of \"n\" in each level of the independent variable) we might consider using a pooled t-test for independent variables.<\/td>\r\n<td class=\"border\" style=\"width: 50%;height: 91px;text-align: center;vertical-align: middle\" scope=\"col\"><img src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t3.png\" alt=\"\" width=\"452\" height=\"77\" class=\"aligncenter size-full wp-image-2010\" \/><\/td>\r\n<\/tr>\r\n<tr class=\"border\" style=\"height: 93px\">\r\n<td style=\"width: 50%;height: 93px\">To evaluate the significance of the difference between two mean scores (regardless of the sample size \"n\") when we consider un-pooled or unequal variances<\/td>\r\n<td class=\"border\" style=\"width: 50%;height: 93px;text-align: center;vertical-align: middle\" scope=\"col\"><img src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t4.png\" alt=\"\" width=\"360\" height=\"115\" class=\"aligncenter size-full wp-image-2011\" \/><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nIn Figure 30.1 below we see the two dimensions of variance that can contribute to the differences observed in a t-test calculation. As illustrated, not only does the t-test process the differences between means, but the difference is also influenced by the variability between members within each sample.\r\n<p style=\"text-align: center\"><img src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1-300x225.png\" alt=\"\" class=\"aligncenter size-medium wp-image-978\" width=\"300\" height=\"225\" \/><\/p>\r\n<p style=\"text-align: center\">Figure 30.1 Comparison of Two Independent Means<\/p>\r\nNotice that the decision about which t-test formula to select is dependent on the research design created by the researcher. For example, as noted in table 30.1, to evaluate the significance of the difference between two mean scores (regardless of the size of \"n\" in each level of the independent variables) we could consider using a t-test for independent variables, as noted in Table 30.1 formula 3. However, if the number of participants in the samples being compared is equal then it is more appropriate to use Table 30.1 formula 4.\u00a0 In this latter case, given that the sample size between the groups was equal and it is expected that the variance within the two groups is similar. In this case, we can pool or combine the variance estimates as we assume that the two groups have homogenous variance within and between the two groups.\r\n<div class=\"textbox shaded\"><span style=\"color: #0000ff\">NB The term homoscedasticity is a term that indicates equality of variance between independent variables. The term is more often used to refer to the variance estimates for each independent variable in a statistical model, as in a multiple linear regression equation. Homoscedasticity suggests that the variables have the same variance.<\/span><\/div>\r\n<h6><strong>Scenario 30.1.1 Comparing 2 groups with unknown variance and different sample sizes<\/strong><\/h6>\r\n<div>\r\n\r\nConsider the following scenario in which there are two groups with different sample sizes (number of participants in each group) and the variance is unknown within each of the groups.\u00a0 In this situation, the analysis of data uses the t-test for two independent groups.\u00a0 We can use the formula for the t-test for independent groups (with unequal sample sizes) to compute the significance of the differences in the mean scores from group1 in which the sample size is 10 participants and the mean scores from group2 \u00a0where the sample size is comprised of 8 participants.\r\n\r\n<strong>Table 30.2 Comparing Responses for two groups with unequal sample sizes and unknown variance<\/strong>\r\n\r\nData for Group 1\r\n\r\n<\/div>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\">Participant ID<\/th>\r\n<td>001<\/td>\r\n<td>002<\/td>\r\n<td>003<\/td>\r\n<td>004<\/td>\r\n<td>005<\/td>\r\n<td>006<\/td>\r\n<td>007<\/td>\r\n<td>008<\/td>\r\n<td>009<\/td>\r\n<td>010<\/td>\r\n<\/tr>\r\n<tr>\r\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\">Score<\/th>\r\n<td>\u00a0234<\/td>\r\n<td>254<\/td>\r\n<td>\u00a0260<\/td>\r\n<td>268<\/td>\r\n<td>253<\/td>\r\n<td>270<\/td>\r\n<td>281<\/td>\r\n<td>287<\/td>\r\n<td>265<\/td>\r\n<td>255<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nData for Group 2\r\n<table style=\"height: 58px\">\r\n<tbody>\r\n<tr style=\"height: 29px\">\r\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle;height: 29px;width: 93.45px\" scope=\"col\">Participant ID<\/th>\r\n<td style=\"height: 29px;width: 25.45px\">001<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">002<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">003<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">004<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">005<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">006<\/td>\r\n<td style=\"height: 29px;width: 24.65px\">007<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">008<\/td>\r\n<\/tr>\r\n<tr style=\"height: 29px\">\r\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle;height: 29px;width: 93.45px\" scope=\"col\">Score<\/th>\r\n<td style=\"height: 29px;width: 25.45px\">304<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">235<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">212<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">198<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">273<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">289<\/td>\r\n<td style=\"height: 29px;width: 24.65px\">301<\/td>\r\n<td style=\"height: 29px;width: 26.25px\">209<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nThe degrees of freedom for the two-group t-test is df=n<sub>1<\/sub>+n<sub>2<\/sub>-2, and in this case df=10-8-2=16 so that the t critical value based on an <em>alpha level of 0.05<\/em> is 2.12.\r\n\r\nIn the following SAS code, we compute the difference between the means for the data in Table 30.2.\u00a0 Here we include a grouping variable so that we can distinguish the data for each group before computing the t-test.\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">T-test for the difference between means with unequal sample size<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<div>OPTIONS PAGESIZE=65 LINESIZE=80;\r\nDATA INDT_TST;\r\nINPUT\u00a0 ID GROUP SCORE @@;\r\nDATALINES;\r\n001 1 234\u00a0002 1 254\u00a0003 1 260\u00a0004 1 268\u00a0005 1 253\u00a0006 1 270\u00a0007 1 281\u00a0008 1 287\u00a0009 1 265\u00a0010 1 255\u00a0011 2 304\u00a0012 2 235\u00a0013 2 212\u00a0014 2 198\u00a0015 2 273\u00a0016 2 289\u00a0017 2 301\u00a0018 2 209\r\n;\r\nPROC SORT DATA=INDT_TST; BY GROUP;\r\nPROC TTEST; CLASS GROUP; VAR SCORE;\r\nRUN;<\/div>\r\n<\/div>\r\n<\/div>\r\nThe output for the PROC T-TEST procedure for this independent t-test analysis is shown below. The INDEPENDENT t-test Procedure.\r\n<table style=\"border-collapse: collapse;width: 100%\" border=\"0\">\r\n<tbody>\r\n<tr class=\"shaded\">\r\n<td style=\"width: 11.1111%;text-align: center\">GROUP<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">N<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">MEAN<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">STD<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">STD ERR<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">MINIMUM<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">MAXIMUM<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 11.1111%;text-align: center\">1<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">10<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">262.7<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">\u00a015.17<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">4.79<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">234.0<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">287.0<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 11.1111%;text-align: center\">2<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">8<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">252.6<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">44.02<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">15.56<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">198.0<\/td>\r\n<td style=\"width: 11.1111%;text-align: center\">304.0<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<table style=\"border-collapse: collapse;width: 57.5436%\" border=\"0\">\r\n<thead>\r\n<tr class=\"shaded\">\r\n<td style=\"width: 20%;text-align: center\">GROUP<\/td>\r\n<td style=\"width: 20%;text-align: center\">MEAN<\/td>\r\n<td style=\"width: 2.22717%;text-align: center\">STANDARD DEVIATION<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 20%;text-align: center\">Diff (1-2)<\/td>\r\n<td style=\"width: 20%;text-align: center\">10.08<\/td>\r\n<td style=\"width: 2.22717%;text-align: center\">31.26<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<div align=\"center\">\r\n<table style=\"width: 255px\">\r\n<thead>\r\n<tr class=\"shaded\">\r\n<td style=\"width: 55.85px;text-align: center\">group<\/td>\r\n<td style=\"width: 81.45px;text-align: center\">Method<\/td>\r\n<td style=\"width: 49.45px;text-align: center\">Mean<\/td>\r\n<td style=\"width: 66.25px;text-align: center\">Std\u00a0Dev<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 55.85px;text-align: center\">1<\/td>\r\n<td style=\"width: 81.45px;text-align: center\"><\/td>\r\n<td style=\"width: 49.45px;text-align: center\">262.7<\/td>\r\n<td style=\"width: 66.25px;text-align: center\">\u00a015.17<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 55.85px;text-align: center\">2<\/td>\r\n<td style=\"width: 81.45px;text-align: center\"><\/td>\r\n<td style=\"width: 49.45px;text-align: center\">252.6<\/td>\r\n<td style=\"width: 66.25px;text-align: center\">44.<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 55.85px;text-align: center\">Diff (1-2)<\/td>\r\n<td style=\"width: 81.45px;text-align: center\">Pooled<\/td>\r\n<td style=\"width: 49.45px;text-align: center\">10.0750<\/td>\r\n<td style=\"width: 66.25px;text-align: center\">31,26<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 55.85px;text-align: center\">Diff (1-2)<\/td>\r\n<td style=\"width: 81.45px;text-align: center\">Satterthwaite<\/td>\r\n<td style=\"width: 49.45px;text-align: center\">10.0750<\/td>\r\n<td style=\"width: 66.25px;text-align: center\">47.37<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr class=\"shaded\">\r\n<td style=\"text-align: center\">Method<\/td>\r\n<td style=\"text-align: center\">Variances<\/td>\r\n<td style=\"text-align: center\">DF<\/td>\r\n<td style=\"text-align: center\">t\u00a0Value<\/td>\r\n<td style=\"text-align: center\">Pr\u00a0&gt;\u00a0|t|<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>Pooled<\/td>\r\n<td>Equal<\/td>\r\n<td>16<\/td>\r\n<td>0.68<\/td>\r\n<td>0.51<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Satterthwaite<\/td>\r\n<td>Unequal<\/td>\r\n<td>8.33<\/td>\r\n<td>0.62<\/td>\r\n<td>0.55<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nEquality of Variances\r\n\r\n<\/div>\r\n<div align=\"center\">\r\n<table class=\" aligncenter\">\r\n<thead>\r\n<tr class=\"shaded\">\r\n<td style=\"width: 56.65px;text-align: center\">Method<\/td>\r\n<td style=\"width: 55.05px;text-align: center\">Num\u00a0DF<\/td>\r\n<td style=\"width: 50.25px;text-align: center\">Den\u00a0DF<\/td>\r\n<td style=\"width: 48.65px;text-align: center\">F Value<\/td>\r\n<td style=\"width: 46.25px;text-align: center\">Pr\u00a0&gt;\u00a0F<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 56.65px\">Folded F<\/td>\r\n<td style=\"width: 55.05px;text-align: center\">7<\/td>\r\n<td style=\"width: 50.25px;text-align: center\">9<\/td>\r\n<td style=\"width: 48.65px;text-align: center\">8.42<\/td>\r\n<td style=\"width: 46.25px;text-align: center\">0.0049<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\nTo determine if the mean scores in each group were significantly different we typically compare the t-observed to the t critical values, generally available in a reference table. For example, the t critical value for \u03b1 = 0.05, where df=9 is 2.262 for a two-tailed test and 1.833 for a one-tailed test. \u00a0In the SAS output shown here, a t-critical is not given, however, the probability of achieving the t-value that was computed is reported and this is the indicator of significance. That is to say, when the Pr\u00a0&gt;\u00a0|t| is greater than 0.05, as it is in this instance, we would accept the null hypothesis that there is no difference between the mean scores in each group.\r\n\r\nAdditionally, in the SAS output, we observe a t-value for both a pooled variance estimate and for an un-pooled variance estimate, where the Satterthwaite t Value estimates the unequal\/un-pooled variance. As a general rule because the Folded F stat is a test of unequal variances, when the <strong>folded F statistic<\/strong> is large and the p-value is &lt;0.05, as shown in the SAS output above, then we refer to the Satterthwaite unequal variances estimate to determine the decision rule regarding the comparison of means via the t-test.\r\n\r\n<strong>30.2 On the importance of p-values\u00a0\u00a0\u00a0<\/strong>\r\n\r\nIn the following data set there were 2 groups of 15 individuals.\u00a0 A test was conducted and each individual produced a score.\u00a0 The means were then computed for the scores in each group and a t-test was used to determine if there was a significant difference between the means for each group.\u00a0 The null hypothesis was given as:\u00a0 H<sub>0<\/sub>: mean for group1 = mean for group2\r\n\r\n<strong>Data in Scenario 1:<\/strong>\r\n<div class=\"textbox shaded\">\r\n\r\n<strong>The data for each group is shown here using the format (id, group, score): <\/strong>\r\n\r\n<strong>001 01 12,<\/strong> 002 01 25, 003 01 26, 004 01 23, 005 01 14, 006 01 15, 007 01 17, 008 01 11, 009 01 18, 010 01 14, 021 01 25, 023 01 28, 025 01 26, 027 01 23, 029 01 24 011 02 15, 012 02 34, 013 02 39, 014 02 35, 015 02 34, 016 02 33, 017 02 15, 018 02 31, 019 02 13, 020 02 20, 022 02 16, 024 02 22, 026 02 27, 028 02 26, 030 02 25\r\n\r\n<\/div>\r\nThe results of the t-test computation using SAS are shown here:\r\n\r\n<strong>The UNIVARIATE Procedure \u2013 Data for the total group for Dependent Variable Score<\/strong>\r\n<div align=\"center\">\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>N<\/strong><\/td>\r\n<td style=\"width: 75.85px\">30<\/td>\r\n<td style=\"width: 126.25px\"><strong>Sum Weights<\/strong><\/td>\r\n<td style=\"width: 77.45px\">30<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Mean<\/strong><\/td>\r\n<td style=\"width: 75.85px\">22.8666667<\/td>\r\n<td style=\"width: 126.25px\"><strong>Sum Observations<\/strong><\/td>\r\n<td style=\"width: 77.45px\">686<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Std Deviation<\/strong><\/td>\r\n<td style=\"width: 75.85px\">7.7135498<\/td>\r\n<td style=\"width: 126.25px\"><strong>Variance<\/strong><\/td>\r\n<td style=\"width: 77.45px\">59.4988506<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Skewness<\/strong><\/td>\r\n<td style=\"width: 75.85px\">0.25907362<\/td>\r\n<td style=\"width: 126.25px\"><strong>Kurtosis<\/strong><\/td>\r\n<td style=\"width: 77.45px\">-0.8503857<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Uncorrected SS<\/strong><\/td>\r\n<td style=\"width: 75.85px\">17412<\/td>\r\n<td style=\"width: 126.25px\"><strong>Corrected SS<\/strong><\/td>\r\n<td style=\"width: 77.45px\">1725.46667<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 107.85px\"><strong>Coeff Variation<\/strong><\/td>\r\n<td style=\"width: 75.85px\">33.7327251<\/td>\r\n<td style=\"width: 126.25px\"><strong>Std Error Mean<\/strong><\/td>\r\n<td style=\"width: 77.45px\">1.40829508<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<strong>The t-TEST Procedure <\/strong>\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr>\r\n<td><strong>group<\/strong><\/td>\r\n<td><strong>N<\/strong><\/td>\r\n<td><strong>Mean<\/strong><\/td>\r\n<td><strong>Std\u00a0Dev<\/strong><\/td>\r\n<td><strong>Std\u00a0Err<\/strong><\/td>\r\n<td><strong>Minimum<\/strong><\/td>\r\n<td><strong>Maximum<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td><strong>1<\/strong><\/td>\r\n<td>15<\/td>\r\n<td>20.0667<\/td>\r\n<td>5.8244<\/td>\r\n<td>1.5039<\/td>\r\n<td>11.0000<\/td>\r\n<td>28.0000<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>2<\/strong><\/td>\r\n<td>15<\/td>\r\n<td>25.6667<\/td>\r\n<td>8.5161<\/td>\r\n<td>2.1988<\/td>\r\n<td>13.0000<\/td>\r\n<td>39.0000<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Diff (1-2)<\/strong><\/td>\r\n<td><\/td>\r\n<td>-5.6000<\/td>\r\n<td>7.2955<\/td>\r\n<td>2.6639<\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div align=\"center\">\r\n<table style=\"width: 541px\">\r\n<thead>\r\n<tr>\r\n<td style=\"width: 59.05px\"><strong>group<\/strong><\/td>\r\n<td style=\"width: 103.05px\"><strong>Method<\/strong><\/td>\r\n<td style=\"width: 60.65px\"><strong>Mean<\/strong><\/td>\r\n<td style=\"width: 85.45px\" colspan=\"2\"><strong>95% CL of the Mean<\/strong><\/td>\r\n<td style=\"width: 91.85px\"><strong>Std Dev<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 59.05px\"><strong>1<\/strong><\/td>\r\n<td style=\"width: 103.05px\"><\/td>\r\n<td style=\"width: 60.65px\">20.0667<\/td>\r\n<td style=\"width: 85.45px\">16.8412<\/td>\r\n<td style=\"width: 61.45px\">23.2921<\/td>\r\n<td style=\"width: 91.85px\">5.8244<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 59.05px\"><strong>2<\/strong><\/td>\r\n<td style=\"width: 103.05px\"><\/td>\r\n<td style=\"width: 60.65px\">25.6667<\/td>\r\n<td style=\"width: 85.45px\">20.9506<\/td>\r\n<td style=\"width: 61.45px\">30.3827<\/td>\r\n<td style=\"width: 91.85px\">8.5161<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 59.05px\"><strong>Diff (1-2)<\/strong><\/td>\r\n<td style=\"width: 103.05px\"><strong>Pooled<\/strong><\/td>\r\n<td style=\"width: 60.65px\">-5.6000<\/td>\r\n<td style=\"width: 85.45px\">-11.0568<\/td>\r\n<td style=\"width: 61.45px\">-0.1432<\/td>\r\n<td style=\"width: 91.85px\">7.2955<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 59.05px\"><strong>Diff (1-2)<\/strong><\/td>\r\n<td style=\"width: 103.05px\"><strong>Satterthwaite<\/strong><\/td>\r\n<td style=\"width: 60.65px\">-5.6000<\/td>\r\n<td style=\"width: 85.45px\">-11.0893<\/td>\r\n<td style=\"width: 61.45px\">-0.1107<\/td>\r\n<td style=\"width: 91.85px\"><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr>\r\n<td><strong>Method<\/strong><\/td>\r\n<td><strong>Variances<\/strong><\/td>\r\n<td><strong>DF<\/strong><\/td>\r\n<td><strong>t\u00a0Value<\/strong><\/td>\r\n<td><strong>Pr\u00a0&gt;\u00a0|t|<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td><strong>Pooled<\/strong><\/td>\r\n<td>Equal<\/td>\r\n<td>28<\/td>\r\n<td>-2.10<\/td>\r\n<td>0.0447<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Satterthwaite<\/strong><\/td>\r\n<td>Unequal<\/td>\r\n<td>24.746<\/td>\r\n<td>-2.10<\/td>\r\n<td>0.0459<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>Equality of Variances<\/strong>\r\n\r\n<\/div>\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr>\r\n<td style=\"width: 57.45px\"><strong>Method<\/strong><\/td>\r\n<td style=\"width: 55.85px\"><strong>Num\u00a0DF<\/strong><\/td>\r\n<td style=\"width: 51.05px\"><strong>Den\u00a0DF<\/strong><\/td>\r\n<td style=\"width: 50.25px\"><strong>F Value<\/strong><\/td>\r\n<td style=\"width: 40.65px\"><strong>Pr\u00a0&gt;\u00a0F<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 57.45px\"><strong>Folded F<\/strong><\/td>\r\n<td style=\"width: 55.85px\">14<\/td>\r\n<td style=\"width: 51.05px\">14<\/td>\r\n<td style=\"width: 50.25px\">2.14<\/td>\r\n<td style=\"width: 40.65px\">0.1675<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<span style=\"text-align: initial;font-size: 1em\">So for this output, we would <\/span><strong style=\"text-align: initial;font-size: 1em\">reject<\/strong><span style=\"text-align: initial;font-size: 1em\"> the null hypothesis and suggest that the mean for <\/span><em style=\"text-align: initial;font-size: 1em\">Group 1<\/em><span style=\"text-align: initial;font-size: 1em\"> was significantly different than the mean for <\/span><em style=\"text-align: initial;font-size: 1em\">Group 2<\/em><span style=\"text-align: initial;font-size: 1em\">. <\/span><span style=\"text-align: initial;font-size: 1em\">However, what would happen if in one of the groups we changed one of the scores by 5 points.\u00a0 Notice in the following data set for Scenario 2, all of the scores are exactly the same, except that we changed the data for participant 1 from 12 to 17.<\/span>\r\n\r\n<\/div>\r\n<strong>Data in Scenario 2:<\/strong>\r\n<div class=\"textbox shaded\">\r\n\r\n<strong>The data for each group are shown here using the format (id, group, score):<\/strong>\r\n\r\n<strong>001 01 17,<\/strong> 002 01 25, 003 01 26, 004 01 23, 005 01 14, 006 01 15, 007 01 17, 008 01 11, 009 01 18, 010 01 14, 021 01 25, 023 01 28, 025 01 26, 027 01 23, 029 01 24, 011 02 15, 012 02 34, 013 02 39, 014 02 35, 015 02 34, 016 02 33, 017 02 15, 018 02 31, 019 02 13, 020 02 20, 022 02 16, 024 02 22, 026 02 27, 028 02 26, 030 02 25\r\n\r\n<\/div>\r\nThe t-test output for Scenario 2 uses the exact same data set, except that the score for participant 1 in Group 1 was changed from a score of 12 to a score of 17. Notice the highlighted t values and the highlighted confidence intervals.\r\n<div align=\"center\">\r\n<table style=\"width: 476px\">\r\n<thead>\r\n<tr class=\"shaded\">\r\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>group<\/strong><\/th>\r\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Method<\/strong><\/th>\r\n<td style=\"width: 58.25px;text-align: center\"><strong>Mean<\/strong><\/td>\r\n<td style=\"width: 98.25px;text-align: center\" colspan=\"2\"><strong>95% Confidence Limits for the <\/strong><strong>Mean<\/strong><\/td>\r\n<td style=\"width: 41.45px;text-align: center\"><strong>\u00a0Std Dev<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>1<\/strong><\/th>\r\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>\u00a0<\/strong><\/th>\r\n<td style=\"width: 58.25px\">20.4000<\/td>\r\n<td style=\"width: 98.25px\">17.3755<\/td>\r\n<td style=\"width: 54.25px\">23.4245<\/td>\r\n<td style=\"width: 41.45px\">5.4616<\/td>\r\n<\/tr>\r\n<tr>\r\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>2<\/strong><\/th>\r\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>\u00a0<\/strong><\/th>\r\n<td style=\"width: 58.25px\">25.6667<\/td>\r\n<td style=\"width: 98.25px\">20.9506<\/td>\r\n<td style=\"width: 54.25px\">30.3827<\/td>\r\n<td style=\"width: 41.45px\">8.5161<\/td>\r\n<\/tr>\r\n<tr>\r\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Diff (1-2)<\/strong><\/th>\r\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Pooled<\/strong><\/th>\r\n<td style=\"width: 58.25px\">-5.2667<\/td>\r\n<td style=\"width: 98.25px\">-10.6175<\/td>\r\n<td style=\"width: 54.25px\">0.0841<\/td>\r\n<td style=\"width: 41.45px\">7.1538<\/td>\r\n<\/tr>\r\n<tr>\r\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Diff (1-2)<\/strong><\/th>\r\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Satterthwaite<\/strong><\/th>\r\n<td style=\"width: 58.25px\">-5.2667<\/td>\r\n<td style=\"width: 98.25px\">-10.6597<\/td>\r\n<td style=\"width: 54.25px\">0.1264<\/td>\r\n<td style=\"width: 41.45px\"><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div align=\"center\">\r\n<table>\r\n<thead>\r\n<tr class=\"shaded\">\r\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\"><strong>Method<\/strong><\/th>\r\n<td style=\"text-align: center\"><strong>Variances<\/strong><\/td>\r\n<td style=\"text-align: center\"><strong>DF<\/strong><\/td>\r\n<td style=\"text-align: center\"><strong>t\u00a0Value<\/strong><\/td>\r\n<td style=\"text-align: center\"><strong>Pr\u00a0&gt;\u00a0|t|<\/strong><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\"><strong>Pooled<\/strong><\/th>\r\n<td>Equal<\/td>\r\n<td>28<\/td>\r\n<td>-2.02<\/td>\r\n<td>0.0535<\/td>\r\n<\/tr>\r\n<tr>\r\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\"><strong>Satterthwaite<\/strong><\/th>\r\n<td>Unequal<\/td>\r\n<td>23.85<\/td>\r\n<td>-2.02<\/td>\r\n<td>0.0552<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\nIn this second example analysis, we would <strong>accept<\/strong> the null hypothesis indicating that there was no difference between the means. This decision is based on the comparison of the p-value to the accepted demarcation point of p&lt;0.05.\r\n\r\nDespite that we have a demarcation point for the probability of the observed t-test, we need to consider the range of scores that we could have seen.\u00a0 The confidence interval provides such information for us.\u00a0 In the first example, the 95% confidence interval tells us that we are 95% confident that the difference between means could be somewhere between -11.09 and -0.11.\u00a0 However, in the second example the 95% confidence interval tells us that we are 95% confident that the difference between means could be somewhere between -10.66 and 0.126.\r\n\r\nSullivan and Feinn<a href=\"#_ftn2\">[2]<\/a> provide two quotes to support the need to look beyond the simple comparison of research findings to a p-value. The first quote is by Gene Glass who said,\u00a0<span style=\"color: #0000ff\"><strong><em>\u201c<\/em><em>Statistical significance is the least interesting thing about the\u00a0<\/em><em>results. You should\u00a0 describe the results in terms of measures\u00a0<\/em><em>of magnitude \u2013 not just, does a treatment affect people, but\u00a0 <\/em><em>how much does it affect them.<\/em><em>\u201d.\u00a0 <\/em><\/strong><\/span>\r\n\r\nThe second quote is by Jacob Cohen, who said,<em>\u00a0<strong><span style=\"color: #0000ff\">\u201c<\/span><\/strong><\/em><strong><span style=\"color: #0000ff\"><em>The primary product of a research inquiry is one or more\u00a0<\/em><em>measures of effect size, not P values.<\/em><em>\u201d. <\/em><\/span><\/strong>\r\n\r\nTwo important points of interest arise from this comparison.\u00a0 The first being that the intervals are not only similar in size but that they are similar in the bandwidth on the number line[latex]\\;\\rightarrow\\; [\/latex] lower limits being (-11.09 and -10.66) and upper limits being (-0.11 and 0.126).\u00a0 However, the second point of interest is that we only had to change one score from the entire set of 30 scores and only by 5 points in order to change from showing a significant difference to a non-significant difference.\r\n\r\nIf you consider the standard deviation for the scores in <em>Group 1<\/em> in each trial, you will notice that a 5-point change is less than the score by which we expect any score to vary from the mean.\u00a0 That is, in the two examples the standard deviation is 5.82 and 5.46, respectively. Therefore, changing a score by less than the computed standard deviation was sufficient to cause a decision to change from <strong>significant<\/strong> to <strong>not significant<\/strong>.\r\n\r\n<span style=\"color: #0000ff\">Consider that this was a study in which you invested millions of dollars.\u00a0 If you only relied on the p-value then you would be happy with scenario 1 (a significant difference was found) but you would be tremendously disappointed with scenario 2, and you would unnecessarily throw away valuable information.\u00a0 So a guiding principle may be that despite the reported value of p for any comparison, consider also the standard deviations and the standard errors along with the computed confidence intervals before reporting the findings.<\/span>\r\n\r\n<strong>30.3 Estimating the Effect Size<\/strong>\r\n\r\nWe can compute the effect size \u2013 where the effect size is defined as the magnitude of the difference between the two means <strong>when the difference is adjusted by the standard deviation for the mean of interest.<\/strong> The formula is simply the difference between the two means in a scenario that compares two groups divided by the standard deviation of the group of interest. So how do I establish the group of interest? The confusion in using this formula is often in which standard deviation to select. One way is to simply select one group to be the standard reference group and the other group to be the group of interest.\r\n\r\n<strong>The <em>Effect Size<\/em> formula: <\/strong>[latex]ES = {\\left(\\overline{x_{1}} - \\overline{x_{2}}\\right)\\over{s_{1}} }[\/latex]\r\n\r\n<span style=\"text-align: initial;font-size: 1em\">The <\/span><em style=\"text-align: initial;font-size: 1em\">effect size<\/em><span style=\"text-align: initial;font-size: 1em\"> formula is often interpreted using Cohen\u2019s criteria where an effect size score of 0.2 is considered as a small but noticeable effect, while an effect size score of 0.5 is considered to be a medium effect size, and an effect size score of 0.8 is considered to be a large effect size.<\/span>\r\n\r\nWe can also calculate the confidence interval for the difference between two means in any scenario where we compute the t-observed score. The formula to compute the confidence interval for a mean difference for two independent samples is shown here. The elements for this calculation are produced from the SAS output using PROC UNIVARIATE or PROC MEANS and substituted into the equation. The formula for confidence intervals in a t-test for independent samples is:\r\n\r\n[latex]\\left(\\overline{x_{1}} - \\overline{x_{2}}\\right) \\pm t_{0.05} \\times \\sqrt{\\frac{s^2_{1}}{n_{1}}+ \\frac{s^2_{2}}{n_{2}}} [\/latex].\r\n\r\nWhere the t<sub>0.05<\/sub> is the critical value for<strong> t<\/strong> for the degrees of freedom in the study. Considering that we have 50 cases in our sample our degrees of freedom value will be: df =\u00a0 (n<sub>1<\/sub>\u2013 1) + (n<sub>2<\/sub> \u2013 1) and the critical value of t<sub>0.05<\/sub> = 2.01\r\n\r\nThe decision rule concerning a confidence interval in a t-test for independent samples is to determine if the range of the confidence interval from the lowest value to the highest value includes 0. If 0 is included in the range then we accept the null hypothesis that the mean for group 1 = the mean for group 2.\r\n\r\n<strong>30.4 The degrees of freedom and critical values<\/strong>\r\n<ul>\r\n \t<li>The term <em>degrees of freedom,<\/em> represents the number of scores within a set of scores that are free to vary, and the number of scores that must be fixed, in order to compute a result.<\/li>\r\n<\/ul>\r\n<strong>\u00a0<\/strong><strong>For example: <\/strong>Consider that the average age for a group of five students is 22. Therefore, the mean of <strong>22<\/strong> is the outcome or the result. Now consider what each student's age must be in order to calculate an average age of 22 [latex] \\rightarrow\\;[\/latex] the outcome (<em>aka the result<\/em>).\r\n\r\n<strong><span style=\"color: #0000ff\"><em>In other words within the set of scores that we observed, what scores are required to make up the set of scores, in order to compute the outcome (result) that we observed?<\/em><\/span><\/strong>\r\n\r\n<em>Let\u2019s work through the concept with the following example:<\/em>\r\n\r\nIdentify a sample of five students and then decide to ask each student to report their age. <span style=\"font-size: 1em\">The first student tells you that she is <\/span><strong style=\"font-size: 1em\">28 <\/strong>years old, <span style=\"font-size: 1em\">but the second student said that he is <\/span><strong style=\"font-size: 1em\">12 <\/strong>years old! Student number 3 reports that he is <strong>129<\/strong> years old and the fourth student suggests that her age is a whopping <strong>-10<\/strong>! No doubt you are realizing that their ages are totally fictitious but they are what they are, and the outcome for the average age remains at 22. Your challenge is now to determine what the age of the fifth student is in order to ensure that the overall group age is 22. That is, the age of the student cannot be <strong>a free choice<\/strong> but must be <strong>a fixed age<\/strong> in order for you to calculate the mean age for the group equal to 22.\r\n\r\nIn this example, you have some known information. You began with the outcome as the mean age of the group equal to 22, and you also have a set of 4 age values that were reported for the group (28, 12, 129, and -10).\u00a0 Since you know the real average age of the five students is 22.\u00a0 You decide to play along with the group, and you realize that you can use simple arithmetic factoring to solve the unknown value of the age for the 5th student.\r\n<ul>\r\n \t<li style=\"list-style-type: none\"><\/li>\r\n<\/ul>\r\n[latex]\\overline{x} = {\\Sigma{\\left(x_{1}\\; +\\; x_{2}\\;+\\; ... \\;+\\; x_{n}\\right)}\\over{n} }[\/latex] [latex] = {\\Sigma{\\left(28\\; +\\; 12\\;+\\; 129 \\;+\\; (-10)\\;+\\; x_{5}\\right)}\\over{n} }[\/latex].\u00a0Work through the computation of the numerator, and then factor out the [latex]x_{5}[\/latex] term by multiplying each side by the denominator value of 5, and then subtracting 159 from both sides as shown here.\r\n<table style=\"border-collapse: collapse;width: 100%\" border=\"0\">\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 33.3333%\">[latex]22 = {159 + x_{5}\\over{5}}[\/latex]<\/td>\r\n<td style=\"width: 33.3333%\">[latex]22 \\times{5} =\u00a0 {\\bcancel{5} \\times \\left(159 +x_{5}\\right)\\over{\\bcancel{5}}}[\/latex]<\/td>\r\n<td style=\"width: 33.3333%\">[latex]110 = \\left(159 + x_{5}\\right)[\/latex]\r\n\r\n[latex]110 - 159 = \\left(x_{5}\\right)[\/latex]\r\n\r\n[latex]-49\u00a0 = \\left(x_{5}\\right)[\/latex]<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nSince you know the real average age of the five students is 22.\u00a0You determine that the age for Student #5 MUST BE\u00a0 (-49).\u00a0The degrees of freedom is a term that represents the number of scores within a set of scores that are free to vary, and the number of scores that must be fixed, in order to compute a result. In your data set, 4 of the ages were free to vary, but the age for Student #5 had to be fixed at (-49) in order to compute the group\u2019s known mean age of 22.\r\n\r\nA simple formula for degrees of freedom is then to consider that the degrees of freedom equal the number of scores that are free to vary minus the number of scores that are fixed within a set of scores.\r\n\r\n<strong>\u00a0<\/strong><strong>Why compute the DEGREES OF FREEDOM?<\/strong>\r\n\r\nThe degrees of freedom term is used to determine the critical value of a statistic given the research design and the sample size. In other words, the value from the probability distribution function for all possible scores of the statistic of interest under an estimate of the probability for a given research scenario.\u00a0 The statistic\u2019s critical score is related to the probability that the null hypothesis is true.\u00a0 In the case of using the t-test, the null hypothesis is a derivative of the <strong>mean observed<\/strong> being equal to the <strong>mean expected<\/strong>.\u00a0 The critical value can change for every application of a statistical computation because it depends on the size of each sample and the probability level set by the researcher to establish whether or not to accept or reject the null hypothesis (i.e. the level of significance).\r\n\r\nAll degrees of freedom computations can be derived from the following formula: <strong>df = (n \u2013 1).\u00a0 <\/strong>Below is a table of degrees of freedom formulae for different types of t-test designs. Notice that the formula differs to enable freedom of at least 1 measure within each array (set) of data.\r\n\r\n<strong>Table 30.3 Degrees of freedom computations for different t-test designs<\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><strong>TERM and Null Hypothesis<\/strong><\/td>\r\n<td><strong>FORMULA<\/strong><\/td>\r\n<td><strong>EXAMPLE<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Student\u2019s t-test\r\n\r\nH0: <em>sample mean<\/em> = 0<\/td>\r\n<td>df = (n \u2013 1)<\/td>\r\n<td>Given a sample size of 10, degrees of freedom is: df=(10 \u2013 1), df = 9\r\n\r\n&nbsp;<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Pair-wise t-test\r\n\r\nH0: <em>preTestmean<\/em> = postTestmean<\/td>\r\n<td>df = (npairs \u2013 1)<\/td>\r\n<td>Given a sample size of 10 pairs, degrees of freedom is: df=(10 \u2013 1), df = 9\r\n\r\n&nbsp;<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>t-test for two group comparisons\u2014equal n in each group<a href=\"#_ftn1\">[1]<\/a>\r\n\r\nH0: <em>grp1mean<\/em> = <em>grp2<\/em>mean<\/td>\r\n<td>(n1 + n2) \u2013 2<\/td>\r\n<td>Given that the sample sizes are n1= 10 and n2=10, degrees of freedom is:\r\n\r\ndf=(10 + 10) \u2013 2, df = 18\r\n\r\n&nbsp;<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>t-test for two group comparisons\u2014unequal n in each group\r\n\r\nH0: <em>grp1mean<\/em> = <em>grp2<\/em>mean<\/td>\r\n<td>(n1\u2013 1) + (n2 \u2013 1)<\/td>\r\n<td>Given that the sample sizes are n1= 8 and n2=13, degrees of freedom is:\r\n\r\ndf=(8-1) + (13-1), df = 19\r\n\r\n&nbsp;<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<a href=\"#_ftnref1\" style=\"text-align: initial;font-size: 1em\">[1]<\/a><span style=\"text-align: initial;font-size: 1em\"> assuming equal variances within the two sample distributions<\/span>\r\n\r\n<a href=\"#_ftnref1\" style=\"text-align: initial;font-size: 1em\">[2]<\/a><span style=\"text-align: initial;font-size: 1em\"> Sullivan, G , and Feinn, R., Using effect size, or why the p Value isn\u2019t enough, Journal of Graduate Medical Education, September 2012, 279-282.<\/span>\r\n\r\n&nbsp;\r\n\r\n&nbsp;","rendered":"<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Learner Outcomes<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>After reading this chapter you should be able to:<\/p>\n<ol>\n<li>Compute the significance of the difference between two sample means when the sample variances are different<\/li>\n<li>Compute the t-test for independent sample means<\/li>\n<li>Compute the t-tests for pooled versus un-pooled variance.<\/li>\n<li>Write a SAS program to compute and identify the important elements of the output for the computation<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<h2>Applications of the t-test under different research scenarios<\/h2>\n<p>In statistics, the t-test has a simple approach despite that it uses a variety of error terms in the denominator as shown in Table 30.1, below. Depending on the research design, the error term will differ to ensure that the appropriate variance estimates within each of the samples are included in the analyses. The following equations demonstrate the different error terms related to the types of comparisons.<\/p>\n<p><strong>Table 30.1 A Summary of t-test Formulae<\/strong><\/p>\n<table style=\"border-collapse: collapse;width: 100%;height: 380px\">\n<thead>\n<tr class=\"shaded\" style=\"height: 15px\">\n<td style=\"width: 50%;text-align: center;height: 15px\">t-test descriptions<\/td>\n<td class=\"border\" style=\"width: 50%;text-align: center;height: 15px;vertical-align: middle\" scope=\"col\">Appropriate t-test formula<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"border\" style=\"height: 30px\">\n<td style=\"width: 50%;height: 30px\">Evaluation of the single sample mean versus the mean for a population<\/td>\n<td class=\"border\" style=\"width: 50%;height: 30px;text-align: center;vertical-align: middle\" scope=\"col\"><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t1.png\" alt=\"\" width=\"500\" height=\"106\" class=\"aligncenter size-full wp-image-2014\" srcset=\"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t1.png 500w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t1-300x64.png 300w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t1-65x14.png 65w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t1-225x48.png 225w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t1-350x74.png 350w\" sizes=\"auto, (max-width: 500px) 100vw, 500px\" \/><\/td>\n<\/tr>\n<tr class=\"border\" style=\"height: 151px\">\n<td style=\"width: 50%;height: 151px\">The pairwise t-test uses the average difference in the measure of interest, from the pre-test score to the post-test score, and then divided by the standard error of the average difference.\u00a0 The standard error of the average difference is computed by dividing the standard deviation of the average difference by the square root of the number of cases in the pairwise comparison.<\/td>\n<td class=\"border\" style=\"width: 50%;height: 151px;text-align: center;vertical-align: middle\" scope=\"col\"><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t2.png\" alt=\"\" width=\"450\" height=\"128\" class=\"aligncenter size-full wp-image-2013\" srcset=\"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t2.png 450w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t2-300x85.png 300w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t2-65x18.png 65w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t2-225x64.png 225w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t2-350x100.png 350w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/td>\n<\/tr>\n<tr class=\"border\" style=\"height: 91px\">\n<td style=\"width: 50%;height: 91px\">To evaluate the significance of the difference between two mean scores (regardless of the size of &#8220;n&#8221; in each level of the independent variable) we might consider using a pooled t-test for independent variables.<\/td>\n<td class=\"border\" style=\"width: 50%;height: 91px;text-align: center;vertical-align: middle\" scope=\"col\"><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t3.png\" alt=\"\" width=\"452\" height=\"77\" class=\"aligncenter size-full wp-image-2010\" srcset=\"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t3.png 452w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t3-300x51.png 300w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t3-65x11.png 65w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t3-225x38.png 225w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t3-350x60.png 350w\" sizes=\"auto, (max-width: 452px) 100vw, 452px\" \/><\/td>\n<\/tr>\n<tr class=\"border\" style=\"height: 93px\">\n<td style=\"width: 50%;height: 93px\">To evaluate the significance of the difference between two mean scores (regardless of the sample size &#8220;n&#8221;) when we consider un-pooled or unequal variances<\/td>\n<td class=\"border\" style=\"width: 50%;height: 93px;text-align: center;vertical-align: middle\" scope=\"col\"><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t4.png\" alt=\"\" width=\"360\" height=\"115\" class=\"aligncenter size-full wp-image-2011\" srcset=\"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t4.png 360w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t4-300x96.png 300w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t4-65x21.png 65w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t4-225x72.png 225w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/t4-350x112.png 350w\" sizes=\"auto, (max-width: 360px) 100vw, 360px\" \/><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>In Figure 30.1 below we see the two dimensions of variance that can contribute to the differences observed in a t-test calculation. As illustrated, not only does the t-test process the differences between means, but the difference is also influenced by the variability between members within each sample.<\/p>\n<p style=\"text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1-300x225.png\" alt=\"\" class=\"aligncenter size-medium wp-image-978\" width=\"300\" height=\"225\" srcset=\"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1-300x225.png 300w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1-768x577.png 768w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1-1024x769.png 1024w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1-65x49.png 65w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1-225x169.png 225w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1-350x263.png 350w, https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-content\/uploads\/sites\/49\/2020\/06\/varcomp1.png 1200w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p style=\"text-align: center\">Figure 30.1 Comparison of Two Independent Means<\/p>\n<p>Notice that the decision about which t-test formula to select is dependent on the research design created by the researcher. For example, as noted in table 30.1, to evaluate the significance of the difference between two mean scores (regardless of the size of &#8220;n&#8221; in each level of the independent variables) we could consider using a t-test for independent variables, as noted in Table 30.1 formula 3. However, if the number of participants in the samples being compared is equal then it is more appropriate to use Table 30.1 formula 4.\u00a0 In this latter case, given that the sample size between the groups was equal and it is expected that the variance within the two groups is similar. In this case, we can pool or combine the variance estimates as we assume that the two groups have homogenous variance within and between the two groups.<\/p>\n<div class=\"textbox shaded\"><span style=\"color: #0000ff\">NB The term homoscedasticity is a term that indicates equality of variance between independent variables. The term is more often used to refer to the variance estimates for each independent variable in a statistical model, as in a multiple linear regression equation. Homoscedasticity suggests that the variables have the same variance.<\/span><\/div>\n<h6><strong>Scenario 30.1.1 Comparing 2 groups with unknown variance and different sample sizes<\/strong><\/h6>\n<div>\n<p>Consider the following scenario in which there are two groups with different sample sizes (number of participants in each group) and the variance is unknown within each of the groups.\u00a0 In this situation, the analysis of data uses the t-test for two independent groups.\u00a0 We can use the formula for the t-test for independent groups (with unequal sample sizes) to compute the significance of the differences in the mean scores from group1 in which the sample size is 10 participants and the mean scores from group2 \u00a0where the sample size is comprised of 8 participants.<\/p>\n<p><strong>Table 30.2 Comparing Responses for two groups with unequal sample sizes and unknown variance<\/strong><\/p>\n<p>Data for Group 1<\/p>\n<\/div>\n<table>\n<tbody>\n<tr>\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\">Participant ID<\/th>\n<td>001<\/td>\n<td>002<\/td>\n<td>003<\/td>\n<td>004<\/td>\n<td>005<\/td>\n<td>006<\/td>\n<td>007<\/td>\n<td>008<\/td>\n<td>009<\/td>\n<td>010<\/td>\n<\/tr>\n<tr>\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\">Score<\/th>\n<td>\u00a0234<\/td>\n<td>254<\/td>\n<td>\u00a0260<\/td>\n<td>268<\/td>\n<td>253<\/td>\n<td>270<\/td>\n<td>281<\/td>\n<td>287<\/td>\n<td>265<\/td>\n<td>255<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Data for Group 2<\/p>\n<table style=\"height: 58px\">\n<tbody>\n<tr style=\"height: 29px\">\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle;height: 29px;width: 93.45px\" scope=\"col\">Participant ID<\/th>\n<td style=\"height: 29px;width: 25.45px\">001<\/td>\n<td style=\"height: 29px;width: 26.25px\">002<\/td>\n<td style=\"height: 29px;width: 26.25px\">003<\/td>\n<td style=\"height: 29px;width: 26.25px\">004<\/td>\n<td style=\"height: 29px;width: 26.25px\">005<\/td>\n<td style=\"height: 29px;width: 26.25px\">006<\/td>\n<td style=\"height: 29px;width: 24.65px\">007<\/td>\n<td style=\"height: 29px;width: 26.25px\">008<\/td>\n<\/tr>\n<tr style=\"height: 29px\">\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle;height: 29px;width: 93.45px\" scope=\"col\">Score<\/th>\n<td style=\"height: 29px;width: 25.45px\">304<\/td>\n<td style=\"height: 29px;width: 26.25px\">235<\/td>\n<td style=\"height: 29px;width: 26.25px\">212<\/td>\n<td style=\"height: 29px;width: 26.25px\">198<\/td>\n<td style=\"height: 29px;width: 26.25px\">273<\/td>\n<td style=\"height: 29px;width: 26.25px\">289<\/td>\n<td style=\"height: 29px;width: 24.65px\">301<\/td>\n<td style=\"height: 29px;width: 26.25px\">209<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The degrees of freedom for the two-group t-test is df=n<sub>1<\/sub>+n<sub>2<\/sub>-2, and in this case df=10-8-2=16 so that the t critical value based on an <em>alpha level of 0.05<\/em> is 2.12.<\/p>\n<p>In the following SAS code, we compute the difference between the means for the data in Table 30.2.\u00a0 Here we include a grouping variable so that we can distinguish the data for each group before computing the t-test.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">T-test for the difference between means with unequal sample size<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<div>OPTIONS PAGESIZE=65 LINESIZE=80;<br \/>\nDATA INDT_TST;<br \/>\nINPUT\u00a0 ID GROUP SCORE @@;<br \/>\nDATALINES;<br \/>\n001 1 234\u00a0002 1 254\u00a0003 1 260\u00a0004 1 268\u00a0005 1 253\u00a0006 1 270\u00a0007 1 281\u00a0008 1 287\u00a0009 1 265\u00a0010 1 255\u00a0011 2 304\u00a0012 2 235\u00a0013 2 212\u00a0014 2 198\u00a0015 2 273\u00a0016 2 289\u00a0017 2 301\u00a0018 2 209<br \/>\n;<br \/>\nPROC SORT DATA=INDT_TST; BY GROUP;<br \/>\nPROC TTEST; CLASS GROUP; VAR SCORE;<br \/>\nRUN;<\/div>\n<\/div>\n<\/div>\n<p>The output for the PROC T-TEST procedure for this independent t-test analysis is shown below. The INDEPENDENT t-test Procedure.<\/p>\n<table style=\"border-collapse: collapse;width: 100%\">\n<tbody>\n<tr class=\"shaded\">\n<td style=\"width: 11.1111%;text-align: center\">GROUP<\/td>\n<td style=\"width: 11.1111%;text-align: center\">N<\/td>\n<td style=\"width: 11.1111%;text-align: center\">MEAN<\/td>\n<td style=\"width: 11.1111%;text-align: center\">STD<\/td>\n<td style=\"width: 11.1111%;text-align: center\">STD ERR<\/td>\n<td style=\"width: 11.1111%;text-align: center\">MINIMUM<\/td>\n<td style=\"width: 11.1111%;text-align: center\">MAXIMUM<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 11.1111%;text-align: center\">1<\/td>\n<td style=\"width: 11.1111%;text-align: center\">10<\/td>\n<td style=\"width: 11.1111%;text-align: center\">262.7<\/td>\n<td style=\"width: 11.1111%;text-align: center\">\u00a015.17<\/td>\n<td style=\"width: 11.1111%;text-align: center\">4.79<\/td>\n<td style=\"width: 11.1111%;text-align: center\">234.0<\/td>\n<td style=\"width: 11.1111%;text-align: center\">287.0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 11.1111%;text-align: center\">2<\/td>\n<td style=\"width: 11.1111%;text-align: center\">8<\/td>\n<td style=\"width: 11.1111%;text-align: center\">252.6<\/td>\n<td style=\"width: 11.1111%;text-align: center\">44.02<\/td>\n<td style=\"width: 11.1111%;text-align: center\">15.56<\/td>\n<td style=\"width: 11.1111%;text-align: center\">198.0<\/td>\n<td style=\"width: 11.1111%;text-align: center\">304.0<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<table style=\"border-collapse: collapse;width: 57.5436%\">\n<thead>\n<tr class=\"shaded\">\n<td style=\"width: 20%;text-align: center\">GROUP<\/td>\n<td style=\"width: 20%;text-align: center\">MEAN<\/td>\n<td style=\"width: 2.22717%;text-align: center\">STANDARD DEVIATION<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"width: 20%;text-align: center\">Diff (1-2)<\/td>\n<td style=\"width: 20%;text-align: center\">10.08<\/td>\n<td style=\"width: 2.22717%;text-align: center\">31.26<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div style=\"margin: auto;\">\n<table style=\"width: 255px\">\n<thead>\n<tr class=\"shaded\">\n<td style=\"width: 55.85px;text-align: center\">group<\/td>\n<td style=\"width: 81.45px;text-align: center\">Method<\/td>\n<td style=\"width: 49.45px;text-align: center\">Mean<\/td>\n<td style=\"width: 66.25px;text-align: center\">Std\u00a0Dev<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"width: 55.85px;text-align: center\">1<\/td>\n<td style=\"width: 81.45px;text-align: center\"><\/td>\n<td style=\"width: 49.45px;text-align: center\">262.7<\/td>\n<td style=\"width: 66.25px;text-align: center\">\u00a015.17<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 55.85px;text-align: center\">2<\/td>\n<td style=\"width: 81.45px;text-align: center\"><\/td>\n<td style=\"width: 49.45px;text-align: center\">252.6<\/td>\n<td style=\"width: 66.25px;text-align: center\">44.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 55.85px;text-align: center\">Diff (1-2)<\/td>\n<td style=\"width: 81.45px;text-align: center\">Pooled<\/td>\n<td style=\"width: 49.45px;text-align: center\">10.0750<\/td>\n<td style=\"width: 66.25px;text-align: center\">31,26<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 55.85px;text-align: center\">Diff (1-2)<\/td>\n<td style=\"width: 81.45px;text-align: center\">Satterthwaite<\/td>\n<td style=\"width: 49.45px;text-align: center\">10.0750<\/td>\n<td style=\"width: 66.25px;text-align: center\">47.37<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr class=\"shaded\">\n<td style=\"text-align: center\">Method<\/td>\n<td style=\"text-align: center\">Variances<\/td>\n<td style=\"text-align: center\">DF<\/td>\n<td style=\"text-align: center\">t\u00a0Value<\/td>\n<td style=\"text-align: center\">Pr\u00a0&gt;\u00a0|t|<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Pooled<\/td>\n<td>Equal<\/td>\n<td>16<\/td>\n<td>0.68<\/td>\n<td>0.51<\/td>\n<\/tr>\n<tr>\n<td>Satterthwaite<\/td>\n<td>Unequal<\/td>\n<td>8.33<\/td>\n<td>0.62<\/td>\n<td>0.55<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Equality of Variances<\/p>\n<\/div>\n<div style=\"margin: auto;\">\n<table class=\"aligncenter\">\n<thead>\n<tr class=\"shaded\">\n<td style=\"width: 56.65px;text-align: center\">Method<\/td>\n<td style=\"width: 55.05px;text-align: center\">Num\u00a0DF<\/td>\n<td style=\"width: 50.25px;text-align: center\">Den\u00a0DF<\/td>\n<td style=\"width: 48.65px;text-align: center\">F Value<\/td>\n<td style=\"width: 46.25px;text-align: center\">Pr\u00a0&gt;\u00a0F<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"width: 56.65px\">Folded F<\/td>\n<td style=\"width: 55.05px;text-align: center\">7<\/td>\n<td style=\"width: 50.25px;text-align: center\">9<\/td>\n<td style=\"width: 48.65px;text-align: center\">8.42<\/td>\n<td style=\"width: 46.25px;text-align: center\">0.0049<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>To determine if the mean scores in each group were significantly different we typically compare the t-observed to the t critical values, generally available in a reference table. For example, the t critical value for \u03b1 = 0.05, where df=9 is 2.262 for a two-tailed test and 1.833 for a one-tailed test. \u00a0In the SAS output shown here, a t-critical is not given, however, the probability of achieving the t-value that was computed is reported and this is the indicator of significance. That is to say, when the Pr\u00a0&gt;\u00a0|t| is greater than 0.05, as it is in this instance, we would accept the null hypothesis that there is no difference between the mean scores in each group.<\/p>\n<p>Additionally, in the SAS output, we observe a t-value for both a pooled variance estimate and for an un-pooled variance estimate, where the Satterthwaite t Value estimates the unequal\/un-pooled variance. As a general rule because the Folded F stat is a test of unequal variances, when the <strong>folded F statistic<\/strong> is large and the p-value is &lt;0.05, as shown in the SAS output above, then we refer to the Satterthwaite unequal variances estimate to determine the decision rule regarding the comparison of means via the t-test.<\/p>\n<p><strong>30.2 On the importance of p-values\u00a0\u00a0\u00a0<\/strong><\/p>\n<p>In the following data set there were 2 groups of 15 individuals.\u00a0 A test was conducted and each individual produced a score.\u00a0 The means were then computed for the scores in each group and a t-test was used to determine if there was a significant difference between the means for each group.\u00a0 The null hypothesis was given as:\u00a0 H<sub>0<\/sub>: mean for group1 = mean for group2<\/p>\n<p><strong>Data in Scenario 1:<\/strong><\/p>\n<div class=\"textbox shaded\">\n<p><strong>The data for each group is shown here using the format (id, group, score): <\/strong><\/p>\n<p><strong>001 01 12,<\/strong> 002 01 25, 003 01 26, 004 01 23, 005 01 14, 006 01 15, 007 01 17, 008 01 11, 009 01 18, 010 01 14, 021 01 25, 023 01 28, 025 01 26, 027 01 23, 029 01 24 011 02 15, 012 02 34, 013 02 39, 014 02 35, 015 02 34, 016 02 33, 017 02 15, 018 02 31, 019 02 13, 020 02 20, 022 02 16, 024 02 22, 026 02 27, 028 02 26, 030 02 25<\/p>\n<\/div>\n<p>The results of the t-test computation using SAS are shown here:<\/p>\n<p><strong>The UNIVARIATE Procedure \u2013 Data for the total group for Dependent Variable Score<\/strong><\/p>\n<div style=\"margin: auto;\">\n<table>\n<tbody>\n<tr>\n<td style=\"width: 107.85px\"><strong>N<\/strong><\/td>\n<td style=\"width: 75.85px\">30<\/td>\n<td style=\"width: 126.25px\"><strong>Sum Weights<\/strong><\/td>\n<td style=\"width: 77.45px\">30<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Mean<\/strong><\/td>\n<td style=\"width: 75.85px\">22.8666667<\/td>\n<td style=\"width: 126.25px\"><strong>Sum Observations<\/strong><\/td>\n<td style=\"width: 77.45px\">686<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Std Deviation<\/strong><\/td>\n<td style=\"width: 75.85px\">7.7135498<\/td>\n<td style=\"width: 126.25px\"><strong>Variance<\/strong><\/td>\n<td style=\"width: 77.45px\">59.4988506<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Skewness<\/strong><\/td>\n<td style=\"width: 75.85px\">0.25907362<\/td>\n<td style=\"width: 126.25px\"><strong>Kurtosis<\/strong><\/td>\n<td style=\"width: 77.45px\">-0.8503857<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Uncorrected SS<\/strong><\/td>\n<td style=\"width: 75.85px\">17412<\/td>\n<td style=\"width: 126.25px\"><strong>Corrected SS<\/strong><\/td>\n<td style=\"width: 77.45px\">1725.46667<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 107.85px\"><strong>Coeff Variation<\/strong><\/td>\n<td style=\"width: 75.85px\">33.7327251<\/td>\n<td style=\"width: 126.25px\"><strong>Std Error Mean<\/strong><\/td>\n<td style=\"width: 77.45px\">1.40829508<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p><strong>The t-TEST Procedure <\/strong><\/p>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr>\n<td><strong>group<\/strong><\/td>\n<td><strong>N<\/strong><\/td>\n<td><strong>Mean<\/strong><\/td>\n<td><strong>Std\u00a0Dev<\/strong><\/td>\n<td><strong>Std\u00a0Err<\/strong><\/td>\n<td><strong>Minimum<\/strong><\/td>\n<td><strong>Maximum<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>1<\/strong><\/td>\n<td>15<\/td>\n<td>20.0667<\/td>\n<td>5.8244<\/td>\n<td>1.5039<\/td>\n<td>11.0000<\/td>\n<td>28.0000<\/td>\n<\/tr>\n<tr>\n<td><strong>2<\/strong><\/td>\n<td>15<\/td>\n<td>25.6667<\/td>\n<td>8.5161<\/td>\n<td>2.1988<\/td>\n<td>13.0000<\/td>\n<td>39.0000<\/td>\n<\/tr>\n<tr>\n<td><strong>Diff (1-2)<\/strong><\/td>\n<td><\/td>\n<td>-5.6000<\/td>\n<td>7.2955<\/td>\n<td>2.6639<\/td>\n<td><\/td>\n<td><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div style=\"margin: auto;\">\n<table style=\"width: 541px\">\n<thead>\n<tr>\n<td style=\"width: 59.05px\"><strong>group<\/strong><\/td>\n<td style=\"width: 103.05px\"><strong>Method<\/strong><\/td>\n<td style=\"width: 60.65px\"><strong>Mean<\/strong><\/td>\n<td style=\"width: 85.45px\" colspan=\"2\"><strong>95% CL of the Mean<\/strong><\/td>\n<td style=\"width: 91.85px\"><strong>Std Dev<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"width: 59.05px\"><strong>1<\/strong><\/td>\n<td style=\"width: 103.05px\"><\/td>\n<td style=\"width: 60.65px\">20.0667<\/td>\n<td style=\"width: 85.45px\">16.8412<\/td>\n<td style=\"width: 61.45px\">23.2921<\/td>\n<td style=\"width: 91.85px\">5.8244<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 59.05px\"><strong>2<\/strong><\/td>\n<td style=\"width: 103.05px\"><\/td>\n<td style=\"width: 60.65px\">25.6667<\/td>\n<td style=\"width: 85.45px\">20.9506<\/td>\n<td style=\"width: 61.45px\">30.3827<\/td>\n<td style=\"width: 91.85px\">8.5161<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 59.05px\"><strong>Diff (1-2)<\/strong><\/td>\n<td style=\"width: 103.05px\"><strong>Pooled<\/strong><\/td>\n<td style=\"width: 60.65px\">-5.6000<\/td>\n<td style=\"width: 85.45px\">-11.0568<\/td>\n<td style=\"width: 61.45px\">-0.1432<\/td>\n<td style=\"width: 91.85px\">7.2955<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 59.05px\"><strong>Diff (1-2)<\/strong><\/td>\n<td style=\"width: 103.05px\"><strong>Satterthwaite<\/strong><\/td>\n<td style=\"width: 60.65px\">-5.6000<\/td>\n<td style=\"width: 85.45px\">-11.0893<\/td>\n<td style=\"width: 61.45px\">-0.1107<\/td>\n<td style=\"width: 91.85px\"><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr>\n<td><strong>Method<\/strong><\/td>\n<td><strong>Variances<\/strong><\/td>\n<td><strong>DF<\/strong><\/td>\n<td><strong>t\u00a0Value<\/strong><\/td>\n<td><strong>Pr\u00a0&gt;\u00a0|t|<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Pooled<\/strong><\/td>\n<td>Equal<\/td>\n<td>28<\/td>\n<td>-2.10<\/td>\n<td>0.0447<\/td>\n<\/tr>\n<tr>\n<td><strong>Satterthwaite<\/strong><\/td>\n<td>Unequal<\/td>\n<td>24.746<\/td>\n<td>-2.10<\/td>\n<td>0.0459<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Equality of Variances<\/strong><\/p>\n<\/div>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr>\n<td style=\"width: 57.45px\"><strong>Method<\/strong><\/td>\n<td style=\"width: 55.85px\"><strong>Num\u00a0DF<\/strong><\/td>\n<td style=\"width: 51.05px\"><strong>Den\u00a0DF<\/strong><\/td>\n<td style=\"width: 50.25px\"><strong>F Value<\/strong><\/td>\n<td style=\"width: 40.65px\"><strong>Pr\u00a0&gt;\u00a0F<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"width: 57.45px\"><strong>Folded F<\/strong><\/td>\n<td style=\"width: 55.85px\">14<\/td>\n<td style=\"width: 51.05px\">14<\/td>\n<td style=\"width: 50.25px\">2.14<\/td>\n<td style=\"width: 40.65px\">0.1675<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><span style=\"text-align: initial;font-size: 1em\">So for this output, we would <\/span><strong style=\"text-align: initial;font-size: 1em\">reject<\/strong><span style=\"text-align: initial;font-size: 1em\"> the null hypothesis and suggest that the mean for <\/span><em style=\"text-align: initial;font-size: 1em\">Group 1<\/em><span style=\"text-align: initial;font-size: 1em\"> was significantly different than the mean for <\/span><em style=\"text-align: initial;font-size: 1em\">Group 2<\/em><span style=\"text-align: initial;font-size: 1em\">. <\/span><span style=\"text-align: initial;font-size: 1em\">However, what would happen if in one of the groups we changed one of the scores by 5 points.\u00a0 Notice in the following data set for Scenario 2, all of the scores are exactly the same, except that we changed the data for participant 1 from 12 to 17.<\/span><\/p>\n<\/div>\n<p><strong>Data in Scenario 2:<\/strong><\/p>\n<div class=\"textbox shaded\">\n<p><strong>The data for each group are shown here using the format (id, group, score):<\/strong><\/p>\n<p><strong>001 01 17,<\/strong> 002 01 25, 003 01 26, 004 01 23, 005 01 14, 006 01 15, 007 01 17, 008 01 11, 009 01 18, 010 01 14, 021 01 25, 023 01 28, 025 01 26, 027 01 23, 029 01 24, 011 02 15, 012 02 34, 013 02 39, 014 02 35, 015 02 34, 016 02 33, 017 02 15, 018 02 31, 019 02 13, 020 02 20, 022 02 16, 024 02 22, 026 02 27, 028 02 26, 030 02 25<\/p>\n<\/div>\n<p>The t-test output for Scenario 2 uses the exact same data set, except that the score for participant 1 in Group 1 was changed from a score of 12 to a score of 17. Notice the highlighted t values and the highlighted confidence intervals.<\/p>\n<div style=\"margin: auto;\">\n<table style=\"width: 476px\">\n<thead>\n<tr class=\"shaded\">\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>group<\/strong><\/th>\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Method<\/strong><\/th>\n<td style=\"width: 58.25px;text-align: center\"><strong>Mean<\/strong><\/td>\n<td style=\"width: 98.25px;text-align: center\" colspan=\"2\"><strong>95% Confidence Limits for the <\/strong><strong>Mean<\/strong><\/td>\n<td style=\"width: 41.45px;text-align: center\"><strong>\u00a0Std Dev<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>1<\/strong><\/th>\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>\u00a0<\/strong><\/th>\n<td style=\"width: 58.25px\">20.4000<\/td>\n<td style=\"width: 98.25px\">17.3755<\/td>\n<td style=\"width: 54.25px\">23.4245<\/td>\n<td style=\"width: 41.45px\">5.4616<\/td>\n<\/tr>\n<tr>\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>2<\/strong><\/th>\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>\u00a0<\/strong><\/th>\n<td style=\"width: 58.25px\">25.6667<\/td>\n<td style=\"width: 98.25px\">20.9506<\/td>\n<td style=\"width: 54.25px\">30.3827<\/td>\n<td style=\"width: 41.45px\">8.5161<\/td>\n<\/tr>\n<tr>\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Diff (1-2)<\/strong><\/th>\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Pooled<\/strong><\/th>\n<td style=\"width: 58.25px\">-5.2667<\/td>\n<td style=\"width: 98.25px\">-10.6175<\/td>\n<td style=\"width: 54.25px\">0.0841<\/td>\n<td style=\"width: 41.45px\">7.1538<\/td>\n<\/tr>\n<tr>\n<th class=\"shaded\" style=\"width: 47.05px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Diff (1-2)<\/strong><\/th>\n<th class=\"shaded\" style=\"width: 97.45px;text-align: center;vertical-align: middle\" scope=\"col\"><strong>Satterthwaite<\/strong><\/th>\n<td style=\"width: 58.25px\">-5.2667<\/td>\n<td style=\"width: 98.25px\">-10.6597<\/td>\n<td style=\"width: 54.25px\">0.1264<\/td>\n<td style=\"width: 41.45px\"><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div style=\"margin: auto;\">\n<table>\n<thead>\n<tr class=\"shaded\">\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\"><strong>Method<\/strong><\/th>\n<td style=\"text-align: center\"><strong>Variances<\/strong><\/td>\n<td style=\"text-align: center\"><strong>DF<\/strong><\/td>\n<td style=\"text-align: center\"><strong>t\u00a0Value<\/strong><\/td>\n<td style=\"text-align: center\"><strong>Pr\u00a0&gt;\u00a0|t|<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\"><strong>Pooled<\/strong><\/th>\n<td>Equal<\/td>\n<td>28<\/td>\n<td>-2.02<\/td>\n<td>0.0535<\/td>\n<\/tr>\n<tr>\n<th class=\"shaded\" style=\"text-align: center;vertical-align: middle\" scope=\"col\"><strong>Satterthwaite<\/strong><\/th>\n<td>Unequal<\/td>\n<td>23.85<\/td>\n<td>-2.02<\/td>\n<td>0.0552<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>In this second example analysis, we would <strong>accept<\/strong> the null hypothesis indicating that there was no difference between the means. This decision is based on the comparison of the p-value to the accepted demarcation point of p&lt;0.05.<\/p>\n<p>Despite that we have a demarcation point for the probability of the observed t-test, we need to consider the range of scores that we could have seen.\u00a0 The confidence interval provides such information for us.\u00a0 In the first example, the 95% confidence interval tells us that we are 95% confident that the difference between means could be somewhere between -11.09 and -0.11.\u00a0 However, in the second example the 95% confidence interval tells us that we are 95% confident that the difference between means could be somewhere between -10.66 and 0.126.<\/p>\n<p>Sullivan and Feinn<a href=\"#_ftn2\">[2]<\/a> provide two quotes to support the need to look beyond the simple comparison of research findings to a p-value. The first quote is by Gene Glass who said,\u00a0<span style=\"color: #0000ff\"><strong><em>\u201c<\/em><em>Statistical significance is the least interesting thing about the\u00a0<\/em><em>results. You should\u00a0 describe the results in terms of measures\u00a0<\/em><em>of magnitude \u2013 not just, does a treatment affect people, but\u00a0 <\/em><em>how much does it affect them.<\/em><em>\u201d.\u00a0 <\/em><\/strong><\/span><\/p>\n<p>The second quote is by Jacob Cohen, who said,<em>\u00a0<strong><span style=\"color: #0000ff\">\u201c<\/span><\/strong><\/em><strong><span style=\"color: #0000ff\"><em>The primary product of a research inquiry is one or more\u00a0<\/em><em>measures of effect size, not P values.<\/em><em>\u201d. <\/em><\/span><\/strong><\/p>\n<p>Two important points of interest arise from this comparison.\u00a0 The first being that the intervals are not only similar in size but that they are similar in the bandwidth on the number line[latex]\\;\\rightarrow\\;[\/latex] lower limits being (-11.09 and -10.66) and upper limits being (-0.11 and 0.126).\u00a0 However, the second point of interest is that we only had to change one score from the entire set of 30 scores and only by 5 points in order to change from showing a significant difference to a non-significant difference.<\/p>\n<p>If you consider the standard deviation for the scores in <em>Group 1<\/em> in each trial, you will notice that a 5-point change is less than the score by which we expect any score to vary from the mean.\u00a0 That is, in the two examples the standard deviation is 5.82 and 5.46, respectively. Therefore, changing a score by less than the computed standard deviation was sufficient to cause a decision to change from <strong>significant<\/strong> to <strong>not significant<\/strong>.<\/p>\n<p><span style=\"color: #0000ff\">Consider that this was a study in which you invested millions of dollars.\u00a0 If you only relied on the p-value then you would be happy with scenario 1 (a significant difference was found) but you would be tremendously disappointed with scenario 2, and you would unnecessarily throw away valuable information.\u00a0 So a guiding principle may be that despite the reported value of p for any comparison, consider also the standard deviations and the standard errors along with the computed confidence intervals before reporting the findings.<\/span><\/p>\n<p><strong>30.3 Estimating the Effect Size<\/strong><\/p>\n<p>We can compute the effect size \u2013 where the effect size is defined as the magnitude of the difference between the two means <strong>when the difference is adjusted by the standard deviation for the mean of interest.<\/strong> The formula is simply the difference between the two means in a scenario that compares two groups divided by the standard deviation of the group of interest. So how do I establish the group of interest? The confusion in using this formula is often in which standard deviation to select. One way is to simply select one group to be the standard reference group and the other group to be the group of interest.<\/p>\n<p><strong>The <em>Effect Size<\/em> formula: <\/strong>[latex]ES = {\\left(\\overline{x_{1}} - \\overline{x_{2}}\\right)\\over{s_{1}} }[\/latex]<\/p>\n<p><span style=\"text-align: initial;font-size: 1em\">The <\/span><em style=\"text-align: initial;font-size: 1em\">effect size<\/em><span style=\"text-align: initial;font-size: 1em\"> formula is often interpreted using Cohen\u2019s criteria where an effect size score of 0.2 is considered as a small but noticeable effect, while an effect size score of 0.5 is considered to be a medium effect size, and an effect size score of 0.8 is considered to be a large effect size.<\/span><\/p>\n<p>We can also calculate the confidence interval for the difference between two means in any scenario where we compute the t-observed score. The formula to compute the confidence interval for a mean difference for two independent samples is shown here. The elements for this calculation are produced from the SAS output using PROC UNIVARIATE or PROC MEANS and substituted into the equation. The formula for confidence intervals in a t-test for independent samples is:<\/p>\n<p>[latex]\\left(\\overline{x_{1}} - \\overline{x_{2}}\\right) \\pm t_{0.05} \\times \\sqrt{\\frac{s^2_{1}}{n_{1}}+ \\frac{s^2_{2}}{n_{2}}}[\/latex].<\/p>\n<p>Where the t<sub>0.05<\/sub> is the critical value for<strong> t<\/strong> for the degrees of freedom in the study. Considering that we have 50 cases in our sample our degrees of freedom value will be: df =\u00a0 (n<sub>1<\/sub>\u2013 1) + (n<sub>2<\/sub> \u2013 1) and the critical value of t<sub>0.05<\/sub> = 2.01<\/p>\n<p>The decision rule concerning a confidence interval in a t-test for independent samples is to determine if the range of the confidence interval from the lowest value to the highest value includes 0. If 0 is included in the range then we accept the null hypothesis that the mean for group 1 = the mean for group 2.<\/p>\n<p><strong>30.4 The degrees of freedom and critical values<\/strong><\/p>\n<ul>\n<li>The term <em>degrees of freedom,<\/em> represents the number of scores within a set of scores that are free to vary, and the number of scores that must be fixed, in order to compute a result.<\/li>\n<\/ul>\n<p><strong>\u00a0<\/strong><strong>For example: <\/strong>Consider that the average age for a group of five students is 22. Therefore, the mean of <strong>22<\/strong> is the outcome or the result. Now consider what each student&#8217;s age must be in order to calculate an average age of 22 [latex]\\rightarrow\\;[\/latex] the outcome (<em>aka the result<\/em>).<\/p>\n<p><strong><span style=\"color: #0000ff\"><em>In other words within the set of scores that we observed, what scores are required to make up the set of scores, in order to compute the outcome (result) that we observed?<\/em><\/span><\/strong><\/p>\n<p><em>Let\u2019s work through the concept with the following example:<\/em><\/p>\n<p>Identify a sample of five students and then decide to ask each student to report their age. <span style=\"font-size: 1em\">The first student tells you that she is <\/span><strong style=\"font-size: 1em\">28 <\/strong>years old, <span style=\"font-size: 1em\">but the second student said that he is <\/span><strong style=\"font-size: 1em\">12 <\/strong>years old! Student number 3 reports that he is <strong>129<\/strong> years old and the fourth student suggests that her age is a whopping <strong>-10<\/strong>! No doubt you are realizing that their ages are totally fictitious but they are what they are, and the outcome for the average age remains at 22. Your challenge is now to determine what the age of the fifth student is in order to ensure that the overall group age is 22. That is, the age of the student cannot be <strong>a free choice<\/strong> but must be <strong>a fixed age<\/strong> in order for you to calculate the mean age for the group equal to 22.<\/p>\n<p>In this example, you have some known information. You began with the outcome as the mean age of the group equal to 22, and you also have a set of 4 age values that were reported for the group (28, 12, 129, and -10).\u00a0 Since you know the real average age of the five students is 22.\u00a0 You decide to play along with the group, and you realize that you can use simple arithmetic factoring to solve the unknown value of the age for the 5th student.<\/p>\n<ul>\n<li style=\"list-style-type: none\"><\/li>\n<\/ul>\n<p>[latex]\\overline{x} = {\\Sigma{\\left(x_{1}\\; +\\; x_{2}\\;+\\; ... \\;+\\; x_{n}\\right)}\\over{n} }[\/latex] [latex]= {\\Sigma{\\left(28\\; +\\; 12\\;+\\; 129 \\;+\\; (-10)\\;+\\; x_{5}\\right)}\\over{n} }[\/latex].\u00a0Work through the computation of the numerator, and then factor out the [latex]x_{5}[\/latex] term by multiplying each side by the denominator value of 5, and then subtracting 159 from both sides as shown here.<\/p>\n<table style=\"border-collapse: collapse;width: 100%\">\n<tbody>\n<tr>\n<td style=\"width: 33.3333%\">[latex]22 = {159 + x_{5}\\over{5}}[\/latex]<\/td>\n<td style=\"width: 33.3333%\">[latex]22 \\times{5} =\u00a0 {\\bcancel{5} \\times \\left(159 +x_{5}\\right)\\over{\\bcancel{5}}}[\/latex]<\/td>\n<td style=\"width: 33.3333%\">[latex]110 = \\left(159 + x_{5}\\right)[\/latex]<\/p>\n<p>[latex]110 - 159 = \\left(x_{5}\\right)[\/latex]<\/p>\n<p>[latex]-49\u00a0 = \\left(x_{5}\\right)[\/latex]<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Since you know the real average age of the five students is 22.\u00a0You determine that the age for Student #5 MUST BE\u00a0 (-49).\u00a0The degrees of freedom is a term that represents the number of scores within a set of scores that are free to vary, and the number of scores that must be fixed, in order to compute a result. In your data set, 4 of the ages were free to vary, but the age for Student #5 had to be fixed at (-49) in order to compute the group\u2019s known mean age of 22.<\/p>\n<p>A simple formula for degrees of freedom is then to consider that the degrees of freedom equal the number of scores that are free to vary minus the number of scores that are fixed within a set of scores.<\/p>\n<p><strong>\u00a0<\/strong><strong>Why compute the DEGREES OF FREEDOM?<\/strong><\/p>\n<p>The degrees of freedom term is used to determine the critical value of a statistic given the research design and the sample size. In other words, the value from the probability distribution function for all possible scores of the statistic of interest under an estimate of the probability for a given research scenario.\u00a0 The statistic\u2019s critical score is related to the probability that the null hypothesis is true.\u00a0 In the case of using the t-test, the null hypothesis is a derivative of the <strong>mean observed<\/strong> being equal to the <strong>mean expected<\/strong>.\u00a0 The critical value can change for every application of a statistical computation because it depends on the size of each sample and the probability level set by the researcher to establish whether or not to accept or reject the null hypothesis (i.e. the level of significance).<\/p>\n<p>All degrees of freedom computations can be derived from the following formula: <strong>df = (n \u2013 1).\u00a0 <\/strong>Below is a table of degrees of freedom formulae for different types of t-test designs. Notice that the formula differs to enable freedom of at least 1 measure within each array (set) of data.<\/p>\n<p><strong>Table 30.3 Degrees of freedom computations for different t-test designs<\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td><strong>TERM and Null Hypothesis<\/strong><\/td>\n<td><strong>FORMULA<\/strong><\/td>\n<td><strong>EXAMPLE<\/strong><\/td>\n<\/tr>\n<tr>\n<td>Student\u2019s t-test<\/p>\n<p>H0: <em>sample mean<\/em> = 0<\/td>\n<td>df = (n \u2013 1)<\/td>\n<td>Given a sample size of 10, degrees of freedom is: df=(10 \u2013 1), df = 9<\/p>\n<p>&nbsp;<\/td>\n<\/tr>\n<tr>\n<td>Pair-wise t-test<\/p>\n<p>H0: <em>preTestmean<\/em> = postTestmean<\/td>\n<td>df = (npairs \u2013 1)<\/td>\n<td>Given a sample size of 10 pairs, degrees of freedom is: df=(10 \u2013 1), df = 9<\/p>\n<p>&nbsp;<\/td>\n<\/tr>\n<tr>\n<td>t-test for two group comparisons\u2014equal n in each group<a href=\"#_ftn1\">[1]<\/a><\/p>\n<p>H0: <em>grp1mean<\/em> = <em>grp2<\/em>mean<\/td>\n<td>(n1 + n2) \u2013 2<\/td>\n<td>Given that the sample sizes are n1= 10 and n2=10, degrees of freedom is:<\/p>\n<p>df=(10 + 10) \u2013 2, df = 18<\/p>\n<p>&nbsp;<\/td>\n<\/tr>\n<tr>\n<td>t-test for two group comparisons\u2014unequal n in each group<\/p>\n<p>H0: <em>grp1mean<\/em> = <em>grp2<\/em>mean<\/td>\n<td>(n1\u2013 1) + (n2 \u2013 1)<\/td>\n<td>Given that the sample sizes are n1= 8 and n2=13, degrees of freedom is:<\/p>\n<p>df=(8-1) + (13-1), df = 19<\/p>\n<p>&nbsp;<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><a href=\"#_ftnref1\" style=\"text-align: initial;font-size: 1em\">[1]<\/a><span style=\"text-align: initial;font-size: 1em\"> assuming equal variances within the two sample distributions<\/span><\/p>\n<p><a href=\"#_ftnref1\" style=\"text-align: initial;font-size: 1em\">[2]<\/a><span style=\"text-align: initial;font-size: 1em\"> Sullivan, G , and Feinn, R., Using effect size, or why the p Value isn\u2019t enough, Journal of Graduate Medical Education, September 2012, 279-282.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"author":56,"menu_order":4,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-973","chapter","type-chapter","status-publish","hentry"],"part":401,"_links":{"self":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters\/973","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/users\/56"}],"version-history":[{"count":50,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters\/973\/revisions"}],"predecessor-version":[{"id":975,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters\/973\/revisions\/975"}],"part":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/parts\/401"}],"metadata":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapters\/973\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/media?parent=973"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/pressbooks\/v2\/chapter-type?post=973"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/contributor?post=973"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/montelpare\/wp-json\/wp\/v2\/license?post=973"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}