{"id":50,"date":"2023-05-18T11:13:35","date_gmt":"2023-05-18T15:13:35","guid":{"rendered":"https:\/\/pressbooks.library.upei.ca\/bio3310\/?post_type=chapter&#038;p=50"},"modified":"2024-08-05T11:40:47","modified_gmt":"2024-08-05T15:40:47","slug":"part-4-basic-applied-statistics-using-minitab-21","status":"publish","type":"chapter","link":"https:\/\/pressbooks.library.upei.ca\/bio3310\/chapter\/part-4-basic-applied-statistics-using-minitab-21\/","title":{"raw":"Part 4: Basic Applied Statistics using Minitab 21","rendered":"Part 4: Basic Applied Statistics using Minitab 21"},"content":{"raw":"<div>\r\n<h1>Part 4: Basic Applied Statistics using Minitab 21<\/h1>\r\n<\/div>\r\n<div>\r\n\r\nA note about statistical significance:\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>All statistical tests are designed to test whether a pattern you see in your data is \u201cstatistically significant\u201d<\/strong>. We say something is statistically significant if our test confirms that the pattern is unlikely to have occurred by chance. To test this we look at the \u201cp-value\u201d\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>The p-value is related to the hypotheses about the data:<\/strong>\r\n<ul>\r\n \t<li><strong>H<\/strong><strong>0<\/strong><strong>\u00a0<\/strong>(Null Hypothesis) is that there is no difference between groups, or no relationship between variables. This is a \u201cno effect\u201d hypothesis<\/li>\r\n \t<li><strong>H<\/strong><strong>A<\/strong><strong>\u00a0<\/strong>(Alternate Hypothesis)\u00a0\u00a0 is that there is a difference or a relationship. This is sometimes called the \u201cactive\u201d hypothesis, since it indicates some sort of effect<\/li>\r\n<\/ul>\r\n<strong>Therefore, to interpret your data, you need to examine the graph of the data and clearly state an hypothesis, or you won\u2019t know what the p-value means!<\/strong>\r\n<table style=\"height: 66px;width: 391px\" width=\"374\">\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 674.533px\">\r\n<div>\r\n\r\n<strong>If p&lt;0.05, reject your Null Hypothesis<\/strong>\r\n\r\n<strong>If p&gt;0.05, accept your Null Hypothesis<\/strong>\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>For each of the tests detailed in the next pages, note what the<\/strong> <strong>Null Hypothesis is, so that you can determine how to interpret<\/strong> <strong>the p-value from the test.<\/strong>\r\n\r\n<\/div>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-1.png\" alt=\"\" width=\"341\" height=\"831\" class=\"alignnone size-full wp-image-383\" \/>\r\n<div>\r\n<h2>Getting started<strong>\u00a0<\/strong><\/h2>\r\n<strong>DATA<\/strong><strong> ENTRY<\/strong>\r\n\r\nWhen working in a spreadsheet, the common method of entering your data is in adjoining columns. For example, if you have data such as we looked at in class on the crab temperatures, you would put your crab data for time 1 in one column, and your crab data for time 2 in the next column. However, for advanced stats, you must enter your data so that all the responses for a single variable (in this case, temperature) are in a single column, with another column giving the key for the variable.\r\n\r\nNote that data in the \u201cadvanced stats\u201d format are set up so that the responses are given in the columns, and the variables (e.g. whether they ran or not, what their sex was\/is given as a category number in another column. This method of data entry is necessary for most statistical packages.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Notes<\/strong><strong> about the data examples<\/strong>\r\n\r\nFor this manual, most of the examples will be from a dataset on an imaginary group of students that were asked to take a bunch of measurements on themselves before and after running in place for one minute.\r\n\r\nOne group of students was asked to run in place for a minute, and another group (the control) did not run.\r\n\r\nStudents (in both groups) were asked to take their pulses (in heartbeats per minute) before and after the running exercise, then were asked to indicate whether they were male or female, whether they smoked or not, whether they thought of themselves as active or not, and so on. This data set allows us to illustrate a wide variety of statistical analyses. The dataset is at the back of this manual, and will be placed on your Moodle site, so that you can practice the exercises in this manual, and see what the answers should look like.\r\n<ul>\r\n \t<li>Ran = 1 means they ran,<\/li>\r\n \t<li>Ran = 2 are the nonrunners (control)<\/li>\r\n \t<li>Smokes = 1 means they smoke<\/li>\r\n \t<li>Sex = 1 means male<\/li>\r\n \t<li>Sex = 2 means female<\/li>\r\n \t<li>Activity levels: 1 = slight, 2 = moderate, 3 = high<\/li>\r\n<\/ul>\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n\r\nTo start in Minitab:\r\n<ul>\r\n \t<li>Open Minitab 21\r\n<ul>\r\n \t<li>You will see a \u201csession\u201d window on top where you\u2019ll find the record of what you\u2019ve done, as well as the text results of statistical tests.<\/li>\r\n \t<li>The worksheet on the bottom will contain your data.<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<strong>Navigate in Minitab through the menus on the upper property toolbar.<\/strong>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2.png\" alt=\"\" width=\"1300\" height=\"1028\" class=\"alignnone size-full wp-image-384\" \/>\r\n\r\n<strong>Entering data:<\/strong>\r\n\r\nData can be entered (typed in) directly, or copied from a spreadsheet\r\n<ul>\r\n \t<li>From spreadsheet: open your spreadsheet to the desired database and select and copy the data. <strong>O<\/strong><strong>nly copy the numbers<\/strong>... do not copy column headers (that will designate your columns as text columns, and cause problems when doing the statistical analyses)<\/li>\r\n \t<li>Reenter Minitab, paste the data into the worksheet.<\/li>\r\n \t<li>Name your columns by clicking on the blank space below the column number, and typing in the name<\/li>\r\n \t<li>Save your worksheet onto your data disk or personal drive. It will save as a .mpx file<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\nSimple Column Statistics<strong>\u00a0<\/strong>\r\n\r\n<strong>Descriptive statistics:<\/strong>\r\n<ul>\r\n \t<li>Select <strong>Stat <\/strong>from the property bar, and click on \u201cbasic statistics\u201d.<\/li>\r\n \t<li>Select \u201c<strong>display descriptive statistics<\/strong>\u201d to see the Display window<\/li>\r\n \t<li>Click on the box labelled \u201cStatistics\u201d to see the range of statistics available.<\/li>\r\n \t<li>The list of variables is on the left and an empty box for the variable(s) to test is on the right.<\/li>\r\n<\/ul>\r\n<strong>Highlight the variable you want to test, and click select<\/strong>\r\n\r\nThere is a long list of potential statistics you can have the computer calculate, all in one operation. Check the boxes of all you would like.\r\n<ul>\r\n \t<li>Click <strong>ok <\/strong>to return to the display descriptive stats window<\/li>\r\n \t<li>Click <strong>ok <\/strong>again to obtain the data.<\/li>\r\n<\/ul>\r\nYour results will appear in the upper Minitab Session Window. You can copy and paste the results into your word processor spreadsheet if you want to.\r\n\r\n<\/div>\r\n<div>\r\n\r\n<strong>Variables<\/strong> <strong>to<\/strong> <strong>choose:<\/strong> Variables must be ordinal (such as Pulse 1, Pulse 2, Weight, Height in this example; Note that your categorical variables won\u2019t work here.\r\n\r\nThis section also allows you to see some basic graphs for your variables, by clicking on \u201cgraphs\u201d rather than\u201cstatistics\u201d\r\n<ul>\r\n \t<li>Click on all of these options to see how these graphs can help you get a feel for what your data looks like.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n<h3>Descriptive statistics for sub-groups within each column of data (e.g. males and females in your group; age groups, etc.)<\/h3>\r\nOptions:\r\n\r\na)\u00a0 cut and paste in your spreadsheet, then copy into Minitab and run analysis twice\r\n\r\nb)\u00a0 <strong>Split the data <\/strong>in Minitab (see page 46 for method) and run analysis twice\r\n\r\nc)\u00a0 use the \u201c<strong>By variables<\/strong><strong>\u201d<\/strong> option and run the analyses simultaneously.\r\n\r\ne.g. In the pulses dataset, each of the two pulse columns (Pulse 1, Pulse 2) include groups (runners &amp; non-runners, males and females, smokers and non-smokers). If you want to compare one subgroup to the other within a single column of data, you will need descriptive statistics and normality testing on each subgroup.\r\n\r\n<strong>Example: <\/strong>calculate the descriptive statistics for the runners and the non-runners separately, in the pulses2 column (the second pulse rate, measured after running).\r\n<ul>\r\n \t<li>Go into <strong>stat <\/strong>on the property bar, and click on \u201c<strong>basic statistics<\/strong>\u201d, then <strong>\u201cdisplay descriptive statistics<\/strong>\u201d. Select the Pulse2 column, and click select. Then place your cursor in the <strong>By variables <\/strong>window, and select the group variable, Ran<\/li>\r\n<\/ul>\r\nYour output (in the Session box) will have information for both subgroups in your variable (i.e. Ran 1 &amp; 2), and your graphs will also show both subgroups.\r\n\r\nYou can see how this lets you look at multiple subgroups separately without having to do a lot of cutting and pasting. you can choose this option in your \u201cstore descriptive statistics\u201d section as well.\r\n\r\n<\/div>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3.png\" alt=\"\" width=\"1339\" height=\"441\" class=\"alignnone size-full wp-image-388\" \/>\r\n<div>\r\n<h3>Normality Testing<\/h3>\r\nMany statistical tests depend on data being parametric (one part of which is normality). Normality testing is a first step for most statistical testing.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\nThe normal distribution is a type of frequency distribution which has a characteristic bell curve shape with a particular height and width for its mean (average) and Standard Deviation. We can assess normality (by eye) by plotting the frequency distribution of the data, and comparing it to the normal curve that is calculated for a data set with this mean and standard deviation. However, it can be hard to see from the frequency plot so there are other methods to assess normality.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>Several methods to test normality: Generally use at least two, since not all work well for all data<\/strong>\r\n\r\na.\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <strong><em>frequency histogram<\/em><\/strong><strong>: <\/strong>important to see what data look like, but not very accurate for assessing whether they fit the normal distribution\r\n\r\nb.\u00a0\u00a0\u00a0\u00a0\u00a0 <strong><em>normal probability plot<\/em><\/strong><strong>:<\/strong> This modifies the frequency scale, so that if data are normal, they fall on a straight line. <strong><em>**This is usually the best method to assess normality<\/em><\/strong>\r\n\r\nc.\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <strong><em>determining whether the shape of the curve fits a mathematical range <\/em><\/strong><strong>(based on \u201cskew\u201d (tails) and \u201ckurtosis\u201d (height of curve)<\/strong>). This is a great method to do as a check on the other methods, in case you just aren\u2019t sure of the interpretation.\r\n\r\nd.\u00a0\u00a0\u00a0\u00a0\u00a0 <strong><em>Statistical methods: <\/em><\/strong>These give some comfort because they seem quantitative, but in fact, they are not accurate in many cases so should be used with caution. Several do not work well with small sample sizes, and several don\u2019t work well if there are many \u201ctied values\u201d (the same number repeated frequently in the dataset)\r\n<h3>Frequency Distributions<\/h3>\r\nFirst, assess the frequency distribution as a first step to see what the data look like.\r\n<ul>\r\n \t<li>Plot the histogram through the <strong>Column Statistics <\/strong>menu as described on p. 83, or through <strong>Graph, Histogram, <\/strong>as described on pp. 73-75.<\/li>\r\n \t<li>In the graphing menu, choose <strong>\u201cwith fit\u201d <\/strong>for the histogram with the normal curve.<\/li>\r\n \t<li>Select your variables as before, then click on \u201c<strong>dataview<\/strong>\u201d. Choose the \u201cdistribution tab, and make sure your distribution says \u201cnormal\u201d and click ok<\/li>\r\n<\/ul>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-4.png\" alt=\"\" width=\"907\" height=\"608\" class=\"alignnone size-full wp-image-389\" \/>\r\n\r\n<\/div>\r\n<div>\r\n\r\n<strong>Does your graph match the bell?<\/strong>\r\n<h3>The normal probability plot<\/h3>\r\n<ul>\r\n \t<li>Select <strong>Graph <\/strong>from the property bar, and then choose Probability plot from the drop-down menu. When prompted, choose the \u201csingle\u201d graph, and click okay<\/li>\r\n \t<li>Choose your variable as before. The normal probability plot is the default, but if you want to test another distribution (e.g. random), you can click on \u201cdistribution\u201d.<\/li>\r\n<\/ul>\r\nThis is produces a plot of your frequency histogram on a special probability scale, so that if the data are normal, your points should fall on a straight line. (Remember that this is for the entire column of data; if you need a subset of the data, such as males vs females, you\u2019ll need to separate them)\r\n\r\nYou can check this by eye, or you can do a statistical normality test on the data.\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-5.png\" alt=\"\" width=\"911\" height=\"607\" class=\"alignnone size-full wp-image-390\" \/>\r\n\r\n<\/div>\r\nThis plot includes the 95% confidence limit for the line. If the points generally fall along the line and are within the confidence limits lines, then you can assume normality\r\n<div><\/div>\r\n<strong>The conclusion from this <\/strong><strong>graph is that the data are normal. <\/strong>The dots deviate from the line very slightly, but not by much, and most fall within the confidence limits. We can run through other methods to see if they confirm our impression from the graph.\r\n<div>\r\n<h3>Using skew and kurtosis calculations<\/h3>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6.png\" alt=\"\" width=\"1344\" height=\"427\" class=\"alignnone size-full wp-image-391\" \/><strong>Data are normally distributed if the standard error of the skew (SE<\/strong><strong>skew<\/strong><strong>)<\/strong> <strong>and the standard error of the kurtosis (SE<\/strong><strong>kurtosis<\/strong><strong>)<\/strong> <strong>fall between -1.96 and +1.96.<\/strong>\r\n\r\n<strong>Method: Determine the Standard Error (SE) of the kurtosis and skew from the skew and kurtosis values in the Descriptive stats analysis (see p. 84) <\/strong>(note that some statistical packages do this calculation for you). The equations below provide an approximation of the SE values.\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-7.png\" alt=\"\" width=\"794\" height=\"122\" class=\"alignnone size-full wp-image-392\" \/>\r\n<ul>\r\n \t<li>SEskew\u00a0= Skew \u00f7 \u221a (6\/n)<\/li>\r\n \t<li>SEkurtosis\u00a0= kurtosis \u00f7 \u221a (24\/n)<\/li>\r\n \t<li>n=no. of obs.<\/li>\r\n<\/ul>\r\nFor the Pulse 1 column:\r\n<ul>\r\n \t<li>SEskew\u00a0= skew \u00f7 \u221a(6\/n)\u00a0\u00a0 = .43 \u00f7 \u221a (6\/91) = 1.67<\/li>\r\n \t<li>SEkurt = kurtosis \u00f7 \u221a (24\/n\u00a0 = -.58 \u00f7 \u221a (24\/91) = -1.13<\/li>\r\n<\/ul>\r\n<strong>These both fall between 1.96 and -1.96, so data are statistically normal<\/strong>\r\n\r\n<strong>Note: the SE skew value is close to non-normal, as you can see by the bars in the figure looking a bit crowded towards the left side, but it still falls in the statistical range.<\/strong>\r\n\r\n<strong>\u00a0<\/strong><strong>This test confirms that the Pulse 1 data are normally distributed<\/strong>\r\n<ul>\r\n \t<li><strong>\u00a0Note: \u201cData is\u201d ??\u00a0\u00a0 \u201cData are\u201d ?? The word \u201cdata\u201d is plural (the singular version is \u201cdatum\u201d) so should always be given with the plural form of the verb when written.\r\n<\/strong><\/li>\r\n<\/ul>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Using Statistical Normality tests:<\/strong>\r\n\r\nOne big problem with statistical normality tests is that they are adversely affected by both small sample sizes and large numbers of \u201ctied values\u201d, i.e. when we have a number of duplicate values in our list of numbers. Tied values often occur if we have a large data set. Therefore the normality test must always be treated with caution, and results should ALWAYS be checked against the normal probability curve.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<h3><strong>\u00a0<\/strong>Using statistical normality tests<\/h3>\r\n<\/div>\r\n<div>\r\n<ul>\r\n \t<li>Minitab provides 3 statistical normality tests. All three will plot the \u201cnormal probability curve\u201d as part of the analysis so you can compare the test result to the graph.<\/li>\r\n \t<li>The purpose of the tests is to see whether the dots are significantly different from the line.<\/li>\r\n<\/ul>\r\n<strong>Method:<\/strong>\r\n<ul>\r\n \t<li>Choose <strong>Stat<\/strong>, Basic Statistics, then <strong>Normality Test <\/strong>(near the bottom of the drop down menu).<\/li>\r\n \t<li>In the Normality Test window, select your variable, choose your test, and click ok.<\/li>\r\n<\/ul>\r\n<strong><img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-8.png\" alt=\"\" width=\"572\" height=\"589\" class=\"alignnone size-full wp-image-393\" \/><\/strong>\r\n\r\n<strong>Which test to choose?<\/strong>\r\n<ul>\r\n \t<li><strong>Anderson Darling<\/strong>: quite strongly affected by tied values, which are often encountered in large sample sizes. If you pick the AD test, then make sure you view the probability plot to see if there are tied values.<\/li>\r\n \t<li><strong>Ryan<\/strong> <strong>Joiner (Shapiro Wilk<\/strong>): this test is useful for large sample sizes (&gt;) as it doesn\u2019t react as strongly to tied values<\/li>\r\n \t<li><strong>The Kolmogorov-Smirnov test<\/strong>: Avoid this test unless it is \/illefors corrected: It is not very powerful and will often say data are normal when they are not. If you use it, use a p-value cut-off of 0.10 rather than 0.05<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\n<strong>The null hypothesis is that data are normal, so:<\/strong>\r\n<ul>\r\n \t<li><strong>p &gt; 0.05, data are normal.<\/strong><\/li>\r\n \t<li><strong>p &lt; 0.05, data are signif. diff. from normal.<\/strong><\/li>\r\n<\/ul>\r\nThe output is a probability graph (but without the confidence lines to help interpret) and the results of the statistical test in the small box at top right.\r\n\r\nLook for the <strong>P-Value <\/strong>to determine if data are normal. If p&lt;0.05, it is non-normal.\r\n\r\nOther information provided: mean and SD values as well as the total \u2018N\u2019 and the test value from the statistical test (in this case, AD for the Anderson Darling). <strong>Do not <\/strong>confuse the test statistic with the P-Value. (If you pick the Ryan Joiner test, it will give the RJ value, and KS for Kolgomorov-Smirnov)\r\n\r\n<\/div>\r\n&nbsp;\r\n\r\n<strong>Interpreting P-Value Results from all three statistical normality tests:<\/strong>\r\n<div>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Anderson Darling<\/td>\r\n<td>Ryan- Joiner<\/td>\r\n<td>Kolmogorov\r\n\r\n-Smirnov<\/td>\r\n<td>Note the differences in result here. Recall that the AD test is badly affected by tied values, and it will usually say data are non-normal when they are actually normal if tied values are present. The Ryan Joiner test is better for tied values, and indicates that data are right on the edge of normal (which is similar to what we saw with the skew\/kurtosis calculation). The K-S test can be used as long as the cut-off is 0.10 rather than 0.05, however these data appear non-normal.<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>p = 0.014<\/td>\r\n<td>&gt;0.100<\/td>\r\n<td>0.016<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>To<\/strong> <strong>assess<\/strong> <strong>normality,<\/strong> <strong>always<\/strong> <strong>use<\/strong> <strong>more<\/strong> <strong>than<\/strong> <strong>one<\/strong> <strong>method:<\/strong>\r\n\r\nFor data to be normal, the histogram should look like a bell curve, the normal probability plot should have points falling close to the line, the SE of the skew and kurtosis should fall between -1.96 and +1.96, and the statistical tests should have a p value greater than 0.05 (or 0.10 for the K-S test). How did we do?\r\n\r\n<span style=\"text-decoration: underline\"><strong>Interpretation:<\/strong><\/span>\r\n<table style=\"height: 150px;width: 648px\" width=\"667\">\r\n<tbody>\r\n<tr style=\"height: 60px\">\r\n<td style=\"width: 510.45px;height: 60px\"><\/td>\r\n<td style=\"width: 10.0167px;height: 60px\"><strong>\u00a0<\/strong>\r\n\r\n&nbsp;<\/td>\r\n<td style=\"width: 128.133px;height: 60px\"><strong>\u00a0<\/strong>\r\n\r\n<span style=\"text-decoration: underline\">Conclusion<\/span><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 510.45px;height: 15px\">Histogram: looks a bit skewed, but not too far off<\/td>\r\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\r\n<td style=\"width: 128.133px;height: 15px\">Normal<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 510.45px;height: 15px\">Normal Prob.Plot: the dots fall within the 95% conf.<\/td>\r\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\r\n<td style=\"width: 128.133px;height: 15px\">Normal<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 510.45px;height: 15px\">Skew &amp; Kurtosis: fall within the range<\/td>\r\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\r\n<td style=\"width: 128.133px;height: 15px\">Normal<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 510.45px;height: 15px\">Anderson-Darling: p = 0.014<\/td>\r\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\r\n<td style=\"width: 128.133px;height: 15px\">Non-normal<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 510.45px;height: 15px\">Ryan-Joiner: p &gt; 0.100<\/td>\r\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\r\n<td style=\"width: 128.133px;height: 15px\">Normal<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 510.45px;height: 15px\">Kolmogorov-Smironov: p = 0.016<\/td>\r\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\r\n<td style=\"width: 128.133px;height: 15px\">Non-normal<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>\u00a0<\/strong>The Anderson Darling and K-S tests give a different result than the others, but since we know it is affected by tied values, we don\u2019t use that one since there are lot of tied values in the plot. The other two indicate that data are normal or very close. <strong>Conclusion: Data are normal<\/strong>\r\n\r\n<strong>\u00a0<\/strong><strong>\u00a0<\/strong>\r\n\r\n<strong>Example using non-normal data<\/strong>\r\n\r\nFor comparison, lets look at some obviously non-normal data: The pulses 2 column:\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9.png\" alt=\"\" width=\"1826\" height=\"617\" class=\"alignnone size-full wp-image-394\" \/>\r\n\r\n<\/div>\r\n<div>\r\n\r\n<strong>Interpretation of the statistical Normality tests:<\/strong><strong>\u00a0<\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Anderson Darling<\/td>\r\n<td>Ryan Joiner<\/td>\r\n<td>Kolmogorov- Smironov<\/td>\r\n<td>This time, all statistical tests indicate that data are non-normal, since the p-values are &lt;0.05 in all cases. Even though there are tied values (so we don\u2019t trust the AD test), the RJ and KS tests are clearly non-normal.<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>p &lt;0.005<\/td>\r\n<td>p&lt;0.01<\/td>\r\n<td>P&lt;0.01<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>\u00a0<\/strong><strong>Evaluating the SE of the Skew and Kurtosis:<\/strong><strong>\u00a0<\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>Skewness1<\/td>\r\n<td>Kurtosis1<\/td>\r\n<td>SEskew\u00a0= skew \u00f7 \/(6\/n)<\/td>\r\n<td>SEkurt\u00a0 = kurtosis \u00f7 \/(24\/n)<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>1.11<\/td>\r\n<td>1.49<\/td>\r\n<td>=1.11 \u00f7 \u221a (6\/91) = 4.32<\/td>\r\n<td>=1.49 \u00f7 \u221a (24\/91) = 27.67<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>\u00a0Interpretation:<\/strong>\r\n\r\n<\/div>\r\n<div>\r\n\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Conclusion\r\n\r\n<\/div>\r\n<div>\r\n\r\nHistogram: quite skewed\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal\r\n\r\nNormal Prob.Plot: the dots are well off the line\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal\r\n\r\nSkew &amp; Kurtosis: fall outside the range\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal\r\n\r\nAnderson-Darling: p &lt;&lt; 0.05\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal\r\n\r\nRyan-Joiner: p &lt;&lt; 0.05\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal\r\n\r\nKolmogorov-Smironov: p &lt;&lt; 0.05\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal\r\n\r\nThese data (Pulses 2) are clearly non-normal, as all normality tests confirm this.\r\n\r\n<strong>\u00a0<\/strong>\r\n\r\n<strong>\u00a0Comments on interpreting normality testing:<\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n<ul>\r\n \t<li>Most statistical tests are \u201crobust\u201d to minor violations of normality, so as long as it is close, data can be considered normal for the purposes of doing the statistical tests to find out if groups differ or are related to each other.<\/li>\r\n \t<li>It is usually easy to tell for data that are strongly normal or strongly non-normal<\/li>\r\n \t<li>For datasets where it seems \u201cclose\u201d, the best tests are to just look at the probability plot (with confidence limits) and to check the skew and kurtosis.<\/li>\r\n<\/ul>\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>\u00a0<\/strong>\r\n<h3>Parametric vs Non-parametric testing:<\/h3>\r\n<ul>\r\n \t<li><strong>\u00a0 Parametric tests are better at picking up statistical differences than non- parametric tests, so we prefer to use them when we can\r\n<\/strong><\/li>\r\n<\/ul>\r\n<strong>Assumptions for parametric testing:<\/strong>\r\n<ul>\r\n \t<li><strong>Data are normally distributed (or close to normal) \u2013 test normality<\/strong><\/li>\r\n \t<li><strong>The variances of the groups are similar to each other \u2013 test \u201cequal variances\u201d<\/strong><\/li>\r\n \t<li><strong>Data are independent of each other \u2013\u00a0 study design point<\/strong><\/li>\r\n \t<li><strong>Data were collected in a random fashion \u2013 study design point<\/strong><\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\n<strong>\u00a0<\/strong>\r\n<h3>Statistical Tests<strong>\u00a0<\/strong><\/h3>\r\n<strong>Which do you use? Here is a key to the basic tests<\/strong>\r\n<h3>Dichotomous key to statistical tests<\/h3>\r\n1a. Are you comparing the averages from groups of data?<strong>........................................................................................................................................................................................... 2<\/strong>\r\n\r\n1b. Are you looking for a relationship between two variables<strong>..................................................................................................................................................................................... 11<\/strong>\r\n\r\n2a. Are you comparing the average from one group of numbers to a single predicted value?<strong>......................................................................................................................... 3<\/strong>\r\n\r\n2b. Are you comparing the average from more than one group to averages?<strong>........................................................................................................................................................ 4<\/strong>\r\n\r\n3a. Are your data normally distributed......................................................................................................................................<strong>...................................................One Sample t-test\r\n<\/strong>\r\n\r\n3b. Are your data non-normal? ......................................................................................................................................................................................<strong>One Sample Wilcoxin test<\/strong>\r\n\r\n4a. Are you comparing the averages of two groups?<strong>.........................................................................................................................................................................................................5<\/strong>\r\n\r\n4b. Are you comparing the averages of three or more groups?<strong>.....................................................................................................................................................................................8<\/strong>\r\n\r\n5a. Are your data paired? (i.e. are you measuring something at time a and b on the same individuals?..................<strong>Paired t-test or non-parametric paired test\r\n<\/strong>\r\n\r\n5b. Are your data unpaired (i.e. are you just comparing the average values for your groups?<strong>............................................................................................................................6<\/strong>\r\n\r\n6a. Are your data non-normal?...........................................................<strong>..........................................................................................................................................Mann-Whitney U-test\r\n<\/strong>\r\n\r\n6b. Are your data normally distributed<strong>....................................................................................................................................................................................................................................7<\/strong>\r\n\r\n7a. Are your data normal with equal variance................................................................................................................................................................................<strong>Student\u2019s t-test<\/strong>\r\n\r\n7b. Are your data normally distributed with unequal variance?...............................................................................................<strong>Students t-test, with variance correction<\/strong>\r\n\r\n8a. Are you comparing averages of three or more groups without subgroups?<strong>.......................................................................................................................................................9<\/strong>\r\n\r\n8b. Do your data have a subgroup or factor you want to compare (e.g. response of males and females within different treatment groups)?<strong>................................10<\/strong>\r\n\r\n9a. Are your data normally distributed with equal variance.......................................................................................................................................................<strong>One Way ANOVA<\/strong>\r\n\r\n9b. Are your data QRQ normal or have unequal variance?<strong>.........................................................................................................................................................Kruskall-Wallis Test<\/strong>\r\n\r\n10a. Are your data normally distributed .....................................................................................................................................................................<strong>Two-Way (factorial) ANOVA<\/strong>\r\n\r\n10b. Are your data QRQ normal or have unequal variance?.............................................................................................................................<strong>No simple non-parametric test<\/strong>\r\n\r\n11a. Are your data normally distributed, with error distribution normal and heterogeneous?<strong>.......................................................Pearson Correlation and Regression<\/strong>\r\n\r\n11b. Are your data non-normal?................................................................................................................................................................................................<strong>Spearman Correlation<\/strong>\r\n\r\n<\/div>\r\n<div>\r\n\r\n&nbsp;\r\n<h3>What are the stats telling us? Comparing Groups or Relationships<\/h3>\r\nWe could be comparing groups or looking for relationships among variables.\r\n\r\n<strong>\u00a0<\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\nOur first step in doing statistical comparisons should be to plot the data with error bars, to get a visual image of what we are comparing.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<h4>Comparing groups<\/h4>\r\nOne of the main types of statistical analyses we do is to compare groups of data. When we do this, we\u2019re really taking the average of the group, and comparing those averages, taking into consideration how variable the data are, and how many samples we have.\r\n\r\n&nbsp;\r\n\r\nThe type of graph we plot depends on the shape of the data (see \u201cFrequency Distributions\u201d, p. 73-75).\r\n<ul>\r\n \t<li>If data are normally distributed, use a bar graph with error bars<\/li>\r\n \t<li>If data are non-normal, use a box and whisker plot<strong>\r\n<\/strong><\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10.png\" alt=\"\" width=\"1826\" height=\"617\" class=\"alignnone size-full wp-image-397\" \/>\r\n\r\nLooking at the relationship between two variables (plot x against y):\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-11.png\" alt=\"\" width=\"901\" height=\"600\" class=\"alignnone size-full wp-image-399\" \/>\r\n<ul>\r\n \t<li>Again, plot the data first, to see what the actual pattern looks like. Then use statistical analysis to see whether the relationship you see with your eye is statistically significant.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Remember<\/strong> <strong>to<\/strong> <strong>test<\/strong> <strong>data<\/strong> <strong>for<\/strong> <strong>normality<\/strong> <strong>before<\/strong> <strong>deciding<\/strong> <strong>which test to use. Since you only have one data set here, you do not also need to test equal variance.<\/strong>\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<h4>Using statistics to compare groups:<\/h4>\r\nWe use different tests depending on the number of groups, and whether data are parametric.\r\n<h5>Comparing one group to a predicted value:<\/h5>\r\n<ul>\r\n \t<li><strong>Parametric: one-sample t-test (compares the mean)<\/strong><\/li>\r\n \t<li><strong>Non-parametric: one-sample Wilcoxin Test (compares the median)<\/strong><strong>\r\n<\/strong><\/li>\r\n<\/ul>\r\nOne sample tests allow us to compare the average and variation in data from an <strong>observed group <\/strong>to a <strong>predicted value<\/strong>.\r\n<ul>\r\n \t<li>The Null Hypothesis is that the mean of your observed data is equal to the predicted value<\/li>\r\n \t<li>If P&lt;0.05, then the means are significantly different.<\/li>\r\n<\/ul>\r\n<strong>Parametric Example:<\/strong> <strong><em>One sample t-test <\/em><\/strong>\r\n\r\n<strong>Study Question: Is the average resting pulse equal to 70 beats per minute?<\/strong>\r\n<ul>\r\n \t<li>H0: Resting pulse (Pulse 1) = 70 beats per minute<\/li>\r\n \t<li>HA: Resting pulse (Pulse 1) \u2026 70 beats per minute\r\n<ul>\r\n \t<li>Choose Stat from the property bar, and then Basic Statistics, and then 1 sample t<\/li>\r\n \t<li>Select your variable (e.g., <strong>pulse 1<\/strong>)<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<strong>Note<\/strong>; if your variable list is blank, just click on the white space in the \u201csamples in columns\r\n<ul>\r\n \t<li>check the box for <strong>\u201cperform hypothesis test<\/strong>\u201d, and type in the value you\u2019re comparing (i.e. 70). Click OK<\/li>\r\n \t<li>Note that we have a simple alternate hypothesis here, of \u201cequal\u201d vs \u201cnot equal\u201d. If you want to be more specific and test if it is greater than or less than your hypothesized mean, click on \u201cOptions\u201d and change it<\/li>\r\n<\/ul>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13.png\" alt=\"\" width=\"395\" height=\"339\" class=\"alignnone size-full wp-image-401\" \/>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Interpretation:<\/strong>\r\n\r\np &lt; 0.05, therefore the mean of the observed resting pulses is significantly different from 70.\r\n\r\n<strong>Trend: <\/strong>Since the mean (from the output) is 73.14, we can say that pulses are significantly higher than 70 beats\/min.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n&nbsp;\r\n\r\n<strong>Non parametric Example:<\/strong>\r\n\r\n<strong><em>One sample Wilcoxin<\/em><\/strong>\r\n\r\nUse this test if data are non-normal.\r\n<ul>\r\n \t<li>Select <strong>Stat<\/strong>, <strong>Nonparametrics, <\/strong>then<strong>1-sample Wilcoxin<\/strong><\/li>\r\n<\/ul>\r\nSet it up the same way as for the 1-sample t-test, but test a median rather than a mean.\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13.png\" alt=\"\" width=\"395\" height=\"339\" class=\"alignnone size-full wp-image-401\" \/>\r\n\r\nNote that this non-parametric test also showed a significant p-value (p=0.028), but that this one is much closer to the 0.05 cutoff than the parametric test, reminding us that this is generally a less powerful test.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Always<\/strong> <strong>choose the parametric test if your data are normal.<\/strong>\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n<h5>Comparing 2 Groups<strong>: <\/strong><\/h5>\r\n<em>Study Question: is Pulse 1 different from Pulse 2?<\/em>\r\n\r\n<strong>\u00a0<\/strong><strong>Testing Parametric Assumptions: <\/strong>Before doing this test, remember to test each group of data for normality (p. 87-92). The other assumption for the parametric test is that variances in the groups are similar, so you will also have to do an \u201cequal variance\u201d test. We already know that data for Pulses 1 were normal, but the Pulses 2 data were not. Therefore, parametric test will give an invalid result and the correct test to do here is the non-parametric test. We will test for equal variance on p. 98. Examples with both tests are shown here so you can see how to do them.\r\n\r\n<\/div>\r\n<div>\r\n<h6>Parametric test: <em>Student\u2019s t-test <\/em>(AKA 2-sample t-test)<\/h6>\r\n<strong>Step<\/strong><strong> 1: <\/strong><strong>Look at plot of data to see pattern<\/strong>\r\n\r\n<\/div>\r\n<div>\r\n\r\n<strong>Step<\/strong><strong> 2: <\/strong><strong>Setting up the data table<\/strong>\r\n\r\n<strong>Minitab has two methods:<\/strong>\r\n<ul>\r\n \t<li>The t-test allows us to put the data into adjoining columns(spreadsheet fashion) or to have them in one column. For this example, our Pulse 1 and Pulse 2 data are already in adjoining columns so we\u2019ll pick this option.<\/li>\r\n<\/ul>\r\n<strong>Step 3: <\/strong><strong>do the test: <\/strong>Choose <strong>2 sample t-test <\/strong>from the Stat\/Basic Statistics drop-down menu.\r\n\r\nNote the box here where it says \u201cassume equal variance\u201d ... if you have tested equal variance and they are statistically equal, then check this box.\r\n\r\n<strong>Output for t-test:<\/strong>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-15.png\" alt=\"\" width=\"541\" height=\"658\" class=\"alignnone size-full wp-image-402\" \/>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\nInterpretation: p &lt; 0.05. Therefore, If assumptions are verified (normal + equal variance), then we would conclude that Pulse 1 is OHVV WKDQ Pulse 2.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>\u00a0Assumptions:<\/strong>\r\n\r\n<\/div>\r\n<div>\r\n\r\n1.\u00a0\u00a0\u00a0 Test Normality: Pulse 2 was not normal (see p. 92)<strong>(this<\/strong><strong> means the t-test result was not valid)<\/strong>\r\n\r\n2.\u00a0\u00a0\u00a0 Test Equal Variance: do variance test as described below\r\n\r\n&nbsp;\r\n\r\n<strong>Test<\/strong> <strong>for equal variance for t-test:<\/strong>\r\n<ul>\r\n \t<li>Choose <strong>\u201c2 Variances\u201d <\/strong>from the Stat\/Basic Statistics drop-down menu.<\/li>\r\n \t<li>Select your variables that you are comparing (in this case, I chose the samples in dif. columns, but if your data were set up in a single column the result would be the same; see p. 100 for example). <strong>Click ok<\/strong><\/li>\r\n<\/ul>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-16.png\" alt=\"\" width=\"708\" height=\"491\" class=\"alignnone size-full wp-image-403\" \/>\r\n\r\nMinitab 21 uses Bonnett and Levene Tests and produces some companion graphs. The boxplot shows the range of the data,so we can see that the variation is Pulse 2 is much higher than that for Pulse 1\r\n\r\nThe interval plot shows us the difference in the standard deviation for the two groups, with confidence limits, so we can see that they don\u2019t overlap.\r\n\r\nNote the outputs for the Equal Variance test in the top right corner.\r\n<ul>\r\n \t<li>Both show that p &lt; 0.05, <strong><em>so we reject the null hypothesis <\/em><\/strong>that variances are equal<\/li>\r\n<\/ul>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Important:<\/strong>\r\n<ul>\r\n \t<li>If the data passed the normality test, but failed the equal variance test, you may still be able to do a t-test<\/li>\r\n \t<li>Sample size must be &gt;10<\/li>\r\n \t<li>Leave \u201cassume equal variance\u201d box unchecked.<\/li>\r\n<\/ul>\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nTherefore, we conclude that variances are different\r\n<h5>Interpretation:<\/h5>\r\nThe data did not pass the assumption tests, therefore any result from the t-test is not valid and cannot be trusted\r\n\r\nTherefore, we would discard our t-test result, and carry out a Mann-Whitney U-test.\r\n\r\n<\/div>\r\n<div>\r\n<h4><strong>Nonparametric test:<\/strong> <strong><em>Mann-Whitney<\/em><\/strong><strong><em> U-test <\/em><\/strong>(Use this test if data are not normal)<\/h4>\r\n<strong>Setting up the data:<\/strong>\r\n<ul>\r\n \t<li>You must have your data in separate columns for this test.<\/li>\r\n \t<li>No assumption testing is needed for this test.<\/li>\r\n<\/ul>\r\n<strong>Carry out the test: <\/strong>Choose the Mann-Whitney test from the <strong>Nonparametrics <\/strong>drop down menu (from <strong>Basic Statistics<\/strong>)\r\n<ul>\r\n \t<li>Select your variables, and click ok<\/li>\r\n<\/ul>\r\n<strong>This test compares medians (middle values in a list that is ranked from smallest to largest) rather than means. <\/strong><strong>Minitab Output:<\/strong>\r\n\r\n<strong> <img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-17.png\" alt=\"\" width=\"398\" height=\"651\" class=\"alignnone size-full wp-image-404\" \/><\/strong>\r\n\r\n<strong>Conclusion: <\/strong><strong>no assumptions are required, and there is a signifcant difference among groups, since p &lt; 0.05<\/strong>\r\n\r\n<strong>Trend statement: <\/strong>The pulse rate of students after running (Pulse 2) was significantly higher than the pulse rate before running (Mann-Whitney U-test, P=0.0048).\r\n\r\n<strong>This is the correct test, so we can trust our result.<\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Since the non-parametric test doesn\u2019t need us to test assumptions, why not just use it all the time?<\/strong>\r\n<ul>\r\n \t<li>When sample sizes are large, the non-parametric and parametric tests often come to the same general conclusion (i.e. the differences are significant or they are not), so it seems like more work to have to do all this extra testing. However, when patterns are not as clear (i.e. when p-values are close to the 0.05 cutoff), or for small sample sizes, it can make a major difference. The Parametric tests have greater <strong>POWER <\/strong>to see a difference if it actually is present, so we always use these ones if we can. Therefore, we must test assumptions first.<\/li>\r\n<\/ul>\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n<h4><strong>A second two-sample example, t-test with data in a single column.<\/strong><\/h4>\r\n<strong>Study question: Do males and females have the same average resting pulse?<\/strong>\r\n<ul>\r\n \t<li><strong>\u00a0<\/strong>Choose the 2-sample t-test following the instructions on p. 97 (<strong>Stat,<\/strong><strong> Basic Stats, 2-sample t)<\/strong><\/li>\r\n \t<li><em>When selecting your variable, you\u2019ll be able to choose the \u201csamples in the same column option\u201d in the t- test menu, but you\u2019ll still have to separate them to check for normality. You can do this manually, by splitting the worksheet (see p. 46), or by \u201cunstacking\u201d the column.<\/em><\/li>\r\n<\/ul>\r\n<strong>\u201cUnstacking\u201d a column to test subgroups separately:<\/strong>\r\n<ul>\r\n \t<li>Choose <strong>Data<\/strong> from the upper property bar, then <strong>Unstack<\/strong> <strong>columns <\/strong>from the drop-down menu:<\/li>\r\n<\/ul>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-18.png\" alt=\"\" width=\"445\" height=\"327\" class=\"alignnone size-full wp-image-405\" \/>\r\n\r\n\u201cSuperscripts\u201d refer to your grouping variable, so if you want to separate out the two sexes (male and female), then choose \u201csex\u201d as your subscript.\r\n<ul>\r\n \t<li>Check the boxes for \u201cafter last column in use\u201d and \u201cname the columns\u201d so that the data will appear in your worksheet.<\/li>\r\n<\/ul>\r\n<strong>Test normality<\/strong>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20.png\" alt=\"\" width=\"1508\" height=\"504\" class=\"alignnone size-full wp-image-406\" \/>\r\n\r\n<\/div>\r\n<div>\r\n\r\nPulse 1 males: The probability plot has most values in the 95% confidence bands, and the p- Value for the Ryan Joiner statistical test is &gt;0.05 (too many tied values for AD)\r\n\r\n<strong>Conclusion: Normal<\/strong>\r\n\r\nPulse 1 females: The probability plot has all values in the 95% confidence bands, and the p-Value for the Ryan Joiner statistical test is &gt;0.05\r\n\r\n<strong>Conclusion: Normal<\/strong>\r\n\r\n<\/div>\r\n<div>\r\n\r\n&nbsp;\r\n\r\n<strong>Test equal variance<\/strong>\r\n\r\n<strong><img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-21.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-407\" \/><\/strong>\r\n\r\n<strong>Conclusion: variance is equal since p&gt;0.05 for both tests<\/strong>\r\n\r\nImportant: do not stop here... these tests only assessed assumptions. Now you must do the test to test your study question.\r\n\r\n<\/div>\r\n&nbsp;\r\n<div>\r\n\r\n&nbsp;\r\n\r\n<strong>Carry out the 2-sample t-test<\/strong><strong>, <\/strong>this time with data in one column.\r\n\r\nSince variances are equal, check box for \u201cAssume equal variances\u201d.\r\n\r\n&nbsp;\r\n\r\n<strong>Two-sample T for pulse1<\/strong>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-22.png\" alt=\"\" width=\"565\" height=\"668\" class=\"alignnone size-full wp-image-408\" \/>\r\n\r\n<strong>Conclusion: <\/strong><strong>Assumptions are satisfied, so test is valid and groups are significantly different since p&lt;0.05.<\/strong>\r\n\r\n<strong>\u00a0<\/strong><strong>Trend statement: <\/strong><strong>Average pulse rate is significantly higher for females (group 2) than for males (group 1) (t-test, p=0.008)<\/strong>\r\n<ul>\r\n \t<li><strong>\u00a0Remember to always write a trend statement giving the clear trend in the data, with the name of the statistical test and the p-value in brackets.<\/strong><strong>\r\n<\/strong><\/li>\r\n<\/ul>\r\n<strong>\u00a0<\/strong>\r\n<table style=\"width: 809px;height: 154px\">\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 794.033px\">\r\n<div>\r\n\r\n<strong>How should you report p-values?<\/strong>\r\n<ul>\r\n \t<li>When possible, report the actual p-value if you have it from the computer test<\/li>\r\n \t<li>Rule of thumb: <strong>only report to a max. of 3 decimal places<\/strong>:<\/li>\r\n \t<li>If the p is given as 0.000 or 0.0000 then write as p&lt;0.001\r\n<ul>\r\n \t<li>Even if the computer reports a value like 0.000, <strong>NEVER <\/strong>say p=0.000, since probability is always higher than zero<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Interesting side note:<\/strong>\r\n\r\nThese questions illustrate how many questions can come out of a single dataset, if the sample size is large enough, and the study design incorporated many variables. In the previous example, we tested whether two different groups of students were different from each other. In this example, we are testing to see whether an individual group of students can show significant differences from one time\r\n\r\nto another. For this to work, the data must be \u201cpaired\u201d... i.e. we must identify each individual and be sure we know his\/her before and after result.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>Two sample testing when values are \u201cPaired\u201d <\/strong><strong><em>Study<\/em><\/strong><strong><em> question Did the pulse rates of the individual students go up after running?<\/em><\/strong>\r\n\r\n<strong>Note<\/strong> <strong>the<\/strong> <strong>importance<\/strong> <strong>of<\/strong> <strong>how the question is worded:<\/strong>\r\n\r\nIf we want to know if the average pulse rate goes up after running, we would do a regular t-test or Mann-Whitney U-test (if non-normal). But because we have data on each individual person, we can see if the individual rates go up by using a test that focuses on individual responses, called a \u201cpaired\u201d test. This is particularly useful if high variability in individuals makes it difficult to see a pattern in the average response, and the paired test will have more power to see the differences than the regular one.\r\n\r\n<strong>How<\/strong> <strong>Paired tests work:\u00a0 <\/strong>In paired tests, we look at a measured value for known individuals at two (or more) times.\r\n\r\nThe example below is for the pulses in the group who ran in place for one minute, so you can see that their pulse rates all went up, though not by the same amount.\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24.png\" alt=\"\" width=\"1408\" height=\"276\" class=\"alignnone size-full wp-image-409\" \/>\r\n\r\nWe can test this using a one-sample test, to see if the group of numbers in the difference column is significantly different from zero.\r\n\r\n<strong>For parametric data, there is a t-test that calculates the differences and does the one sample test automatically in a single step. For non-parametric data, we have to do the extra step ourselves.<\/strong>\r\n\r\n<strong>Step 1: <\/strong>separate out the runners from the non- runners in both our resting pulse and our after running pulse columns (pulse 1 and pulse 2). <strong>This will give us four groups of data as you can see from the graph at right.<\/strong>\r\n\r\n<strong>Remember<\/strong> <strong>to<\/strong> <strong>plot<\/strong> <strong>your<\/strong> <strong>data<\/strong> <strong>so<\/strong> <strong>you<\/strong> <strong>can<\/strong> <strong>see<\/strong> <strong>what you are comparing!<\/strong>\r\n<ul>\r\n \t<li>We can do this manually (by copying and pasting into additional columns, \u201csplitting\u201d the worksheet into multiple smaller worksheets to work on using Minitab or \u201cunstacking\u201d the columns.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-25.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-411\" \/>\r\n\r\nFigure 1. Comparison of pulse rates for a group of students before and after one group ran for 1 Min.\r\n\r\n<\/div>\r\n<div>\r\n\r\n<strong>Step 2: Compare pulse 1 and pulse 2 for the non-runners (control) and then the runners (test)<\/strong>\r\n\r\n<strong>First: Test assumptions:<\/strong>\r\n\r\n<strong>1. <\/strong><strong>Normality testing<\/strong>: test normality of all four groups, using Ryan Joiner (since many tied values)\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26.png\" alt=\"\" width=\"1319\" height=\"871\" class=\"alignnone size-full wp-image-412\" \/>\r\n\r\n<strong>2.\u00a0\u00a0\u00a0 <\/strong><strong>Equal variance<\/strong>: Running group p=0.004\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-running group: p=0.624\r\n\r\n(Variances between before &amp; after running: non-equal for runners, equal for non-runners)\r\n\r\n<strong>Conclusion from assumptions:<\/strong>\r\n<ul>\r\n \t<li>Runners: one group was normal the other non normal; variances were not equal<\/li>\r\n \t<li>Non-runners: data were normal, and variances were equal<\/li>\r\n<\/ul>\r\nTherefore, both the parametric and non-parametric paired tests are needed to assess whether there are differences between the individuals pulse rates at the two different times.\r\n\r\n<\/div>\r\n<div>\r\n<h4><strong>Parametric data - use the Paired t-test<\/strong><\/h4>\r\nRun this test using the Non-runners\u2019 pulse rate data, since these data were normal and had equal variance.\r\n\r\n<strong>Make sure the data to be tested are in separate columns, e.g. non-runners at time 1 and non-runners at time 2.<\/strong>\r\n<ul>\r\n \t<li>Select <strong>Stat <\/strong>from the property bar, and <strong>Basic Statistics.<\/strong><\/li>\r\n \t<li>Choose the <strong>paired t test <\/strong>from the drop down menu.<\/li>\r\n \t<li>Choose variables so that your data for the non-runners from time 1 is being compared to non-runners in time 2.<\/li>\r\n<\/ul>\r\n<strong>Minitab Output:<\/strong>\r\n\r\n&nbsp;\r\n\r\n<strong>Paired T-Test and CI: pulse1nonran, pulse2nonran<\/strong><strong>\u00a0<\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Since p&gt;0.05, we accept the null hypothesis, and say there is no difference in the groups.<\/strong>\r\n\r\nNotice the difference here in what is being tested: instead of comparing one mean to another, it tests whether the mean difference is equal to zero\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n<h4><strong>Non-parametric paired two sample test:<\/strong><\/h4>\r\n<ul>\r\n \t<li>If data are non-normal, then you must use a non-parametric test. There is no one-step non-parametric paired test, but you can carry it out in two steps.<\/li>\r\n<\/ul>\r\n<strong>Step one: <\/strong>Subtract your time 1 column from your time 2 column (as in the table on p. 103). You can do this in Excel, and copy and paste the values into Minitab.\r\n\r\n<strong>Step two: <\/strong>Carry out a one-sample Wilcoxin test to test whether your median is different from zero (as shown on p. 96).\r\n\r\nChoose your column with the runner differences, and check the box for testing that the median is zero:\r\n\r\nMinitab Output:\r\n\r\n<strong> <img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-28.png\" alt=\"\" width=\"246\" height=\"168\" class=\"alignnone size-full wp-image-416\" \/>\r\n<\/strong>\r\n\r\n<strong>Wilcoxon Signed Rank Test: run difs<\/strong>\r\n\r\n<strong>Since p&lt;0.001, then we reject the Null hypothesis; i.e. the pulse rates are significantly different between time 1 and time 2.<\/strong>\r\n\r\n<\/div>\r\n&nbsp;\r\n<div>\r\n<h4><strong>Comparing &gt;2 groups of data<\/strong><\/h4>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Balanced<\/strong> <strong>Designs<\/strong>\r\n\r\nSome packages (e.g. Excel) assume a \u201cbalanced design\u201d; that means that there have to be the same number of observations in each group being studied. In Minitab, a balanced design is not necessary for simple ANOVA (although it is in 2-way ANOVA), but note that if your groups have too few observations, you have very little \u201cpower\u201d to pick out differences.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nAnalysis of Variance (ANOVA) or the non- parametric equivalent (Kruskall-Wallis) allows us to compare more than 2 groups of data. As with the t-test, we are comparing means, and the null hypothesis is that the means are equal.\r\n<ul>\r\n \t<li><strong>If p&lt;0.05, the means are statistically different (assuming the assumptions of ANOVA are met).<\/strong><\/li>\r\n<\/ul>\r\n<strong>Setting up your data:<\/strong>\r\n\r\nYour dataset can be set up in the worksheet so that the responses being measured are all in one column, and the groups that you are comparing are given as categories in a separate column (See our pulses dataset on p. 83 as an example). Alternatively, the data can be set up so your groups are in adjoining columns. For the ANOVA, these have special names: If data are in one column, Minitab refers to this as \u201c<strong>stacked<\/strong>\u201d. If data are in adjoining columns, Minitab refers to this set-up as \u201c<strong>unstacked<\/strong>\u201d.\r\n<h5>Parametric Data: Analysis of Variance (ANOVA)<\/h5>\r\n<strong>One-Way Design <\/strong>(comparing groups without subgroups)\r\n\r\nExample of comparing three or more groups of data; \u201cstacked\u201d data. <strong>Study Question: <em>Are resting pulse rates different depending on normal activity level?<\/em><\/strong>\r\n<ul>\r\n \t<li>This will give three groups to compare, with three activity levels.<\/li>\r\n<\/ul>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\nImportant design note: The design of this study is highly unbalanced, and the group reporting low activity level was very small in number; it may be too small to give random and independent data. Therefore, ANOVA result should be treated with caution.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nAssumptions:\r\n\r\nANOVA assumes that data are parametric, which means:\r\n<ul>\r\n \t<li>Normally distributed<\/li>\r\n \t<li>Variances are equal<\/li>\r\n \t<li>Design should be set up so data are independent and random.<\/li>\r\n<\/ul>\r\n<strong>1. <\/strong><strong>Test for normality <\/strong>for each group of data as explained in the earlier section\r\n<ul>\r\n \t<li>When this was done, all groups were normal (p&gt;0.05)<\/li>\r\n<\/ul>\r\n<strong>2. <\/strong> <strong>Equal Variance Test<\/strong>: <strong>Do not use the equal variance test you used for t-tests!<\/strong>\r\n<ul>\r\n \t<li>To test equal variance on three or more groups of data, we need to use the equal variance test found in the ANOVA menu. The one in the t-test menu only tests variance for two groups of data, not for three or more.<\/li>\r\n<\/ul>\r\nSelect <strong>Stat <\/strong>from the property bar, then <strong>ANOVA<\/strong>. Look part way down the drop-down menu, and select <strong>Test for Equal Variances<\/strong>\r\n\r\nChoose your response variable (in this case, pulses 1 to assess the resting pulse rates) and your factor variable (in this case, activity level), and click okay.\r\n\r\n<strong>Equal variances result:<\/strong>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30.png\" alt=\"\" width=\"1230\" height=\"866\" class=\"alignnone size-full wp-image-420\" \/>\r\n\r\n<\/div>\r\n<div>\r\n\r\np&gt;0.05 so variances are equal\r\n\r\nNotice in the graph how the variances are all very similar with a lot of overlap... that shows that the variances are very similar.\r\n\r\n<strong>Conclusion: <\/strong>Although we need to be cautious about interpreting patterns about the low activity level due to the small and unbalanced sample size, our data meet the assumptions of the ANOVA, so we can run the test.\r\n\r\n<strong>Method for One-Way ANOVA:<\/strong>\r\n<ul>\r\n \t<li>Choose \u201c<strong>stat<\/strong>\u201d from the property bar, then choose \u201c<strong>ANOV<\/strong>A\u201d, then choose \u201c<strong>one-way<\/strong>\u201d. One way analysis of variance means that you are simply comparing 3 or more groups of data, without any subgroups in them.<\/li>\r\n<\/ul>\r\nThis puts you into the menu for the ANOVA with data set up in one column\r\n\r\n<strong>The<\/strong><strong> Null hypothesis <\/strong>is that\r\n\r\nX1 = X2 = X3\r\n\r\ntherefore, a significant p-value (p&lt;0.05) means that at least one group is different from the others.\r\n\r\n<\/div>\r\n<div>\r\n<ul>\r\n \t<li><strong>Select your variables<\/strong>\r\n<ul>\r\n \t<li>The \u201c<strong>response<\/strong>\u201d variable is the one where your measurements are, for example one of the columns of pulse rates.<\/li>\r\n \t<li>The \u201c<strong>factor<\/strong>\u201d is the column where the different groupings or levels are shown.<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\nIn our pulses dataset, we have 3 levels of activity, so we could choose activity level as our factor.\r\n\r\n<strong>Click<\/strong> <strong>on<\/strong> <strong>\u201cokay\u201d,<\/strong> <strong>and<\/strong> <strong>the<\/strong> <strong>output<\/strong> <strong>will look like:<\/strong>\r\n\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n<strong><img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-31.png\" alt=\"\" width=\"532\" height=\"669\" class=\"alignnone size-full wp-image-421\" \/><\/strong>\r\n\r\n<strong>Interpretation: <\/strong>p&gt;0.05, so the means <strong>are not significantly different <\/strong>between the groups. We conclude that activity level <strong>does not <\/strong>have an effect on resting pulse in these students.\r\n\r\n<\/div>\r\n<strong>Remember to always plot your data <\/strong>to see the patterns. This helps you to determine what you actually want to test, and can help you interpret your data patterns. You can see from the graph at right that although it looks like the people with low activity level had a higher resting pulse rate, the variability in the data means that there is no significant difference\r\n<div><\/div>\r\n<div>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-32.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-422\" \/>\r\n\r\n<strong>Conclusion: we have done the appropriate<\/strong> <strong>test, so we can trust our result.<\/strong>\r\n\r\n<strong>Trend Statement: <\/strong><strong>There is no significant difference among resting pulses in the three groups (ANOVA, p = 0.155).<\/strong>\r\n<h5><strong>Non-parametric data: Kruskall-Wallis Test<\/strong><\/h5>\r\n<\/div>\r\n<div>\r\n\r\nIf data were <strong>not <\/strong>normal [and could not be made normal through transformation], we would use the Kruskal-Wallis test, which compares medians.\r\n\r\n<strong>Data Setup:<\/strong>\r\n<ul>\r\n \t<li>Data must be set up so that the response variable (in this case, Pulse rate) is in one column, and the grouping variable is in another column.<\/li>\r\n \t<li>Select \u201c<strong>non-parametrics<\/strong>\u201d from the \u201c<strong>stats<\/strong>\u201d menu, and choose the Kruskal-Wallis.<\/li>\r\n \t<li>Select your variables:<\/li>\r\n<\/ul>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Transforming<\/strong>:\r\n<ul>\r\n \t<li>If your data are not normal and don\u2019t have equal variance, you can use the non-parametric test, or you can try transforming the data. Common transformations are the <strong>log transformation, square root transformation, and inverse tranformation <\/strong>\u2013 To transform, convert all values in each column of data being analysed to the transformation, and try your analyses again.<\/li>\r\n \t<li>Remember to always report your actual data (in the text or in graphs) and not the transformed data<\/li>\r\n<\/ul>\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n&nbsp;\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-33.png\" alt=\"\" width=\"500\" height=\"608\" class=\"alignnone size-full wp-image-423\" \/>\r\n\r\n<strong>Trend statement: <\/strong><strong>There is no significant difference among the pulse rates for the students who have the different activity levels (Kruskal-Wallis test, p=0.153)<\/strong>\r\n\r\n<\/div>\r\n<div>\r\n<h4><strong>Example of comparing three or more groups of data; \u201cun-stacked\u201d data<\/strong><\/h4>\r\n<strong>Study Question: <\/strong><em>Is there a difference among the runners and non-runners, before and after running?<\/em>\r\n\r\nRunning Group\r\n\r\n<strong>Data Setup: <\/strong>For this method, put the data into 4 separate columns, and run a <strong>One-Way ANOVA<\/strong>, <strong>unstacked<\/strong>.\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-34.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-425\" \/>\r\n\r\nA quick plot of the data shows no dif. in pulse between time 1 and 2 for the non-runners, but there seems to be a big difference for the runners. What we would like to know is whether that difference is significant.\r\n\r\n<strong>The null hypothesis is that there is no difference among groups.<\/strong>\r\n\r\n<strong>\u00a0<\/strong><strong>Assumptions<\/strong>:\r\n<ul>\r\n \t<li><strong>Normality: <\/strong>some groups were normal, and others were not. The non-normal groups were relatively close to normal.<\/li>\r\n \t<li><strong>Variances: <\/strong>were not equal<\/li>\r\n<\/ul>\r\n<strong>Conclusion from Assumptions:<\/strong>\r\n\r\nData are non-parametric <strong>(if<\/strong> <strong>even<\/strong> <strong>one<\/strong> <strong>group<\/strong> <strong>you<\/strong> <strong>are<\/strong> <strong>comparing<\/strong> <strong>don\u2019t<\/strong> <strong>fit<\/strong> <strong>assumptions,<\/strong> <strong>then<\/strong> <strong>your<\/strong> <strong>data<\/strong> <strong>are<\/strong> <strong>nonparametric)<\/strong>, so should either be transformed or a non-parametric test should be chosen. However, ANOVA is \u201crobust\u201d to minor violations of assumptions, so since data are close to normal, it may be okay to do the parametric test. To be sure, do both the parametric and non-parametric test and compare.\r\n\r\n<strong>ONE-WAY ANOVA, \u201cunstacked\u201d<\/strong>\r\n<ul>\r\n \t<li>Select <strong>Stat <\/strong>from the property bar, then choose <strong>ANOVA <\/strong>from the drop down menu.<\/li>\r\n \t<li>Choose <strong>One-Way (Unstacked)<\/strong><\/li>\r\n \t<li>Select variables (these must be in adjoining columns in your dataset)<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\n<strong>Output (ANOVA table)<\/strong>\r\n\r\n<strong> <img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-35.png\" alt=\"\" width=\"675\" height=\"668\" class=\"alignnone size-full wp-image-426\" \/><\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Interpretation: <\/strong>p&lt;0.05, so there is a significant difference in the groups. However, we don\u2019t know which groups are different.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>Important<\/strong><strong>\u00a0<\/strong><strong>note:<\/strong>\r\n\r\np&lt;0.05, so groups are significantly different. However, there is no way to know from the simple One-Way ANOVA which group is different from which other(s). We need to do further testing to figure this out.\r\n\r\nMultiple Comparison Tests:\r\n\r\n<strong>\u00a0<\/strong><strong>ANOVA is designed to tell us if there are any differences, but not which groups are different. For this: do Multiple Comparison Tests<\/strong><strong>\u00a0<\/strong>\r\n<table style=\"height: 259px\">\r\n<tbody>\r\n<tr style=\"height: 259px\">\r\n<td style=\"height: 259px;width: 1364.03px\">\r\n<div>\r\n\r\n<strong>Type<\/strong><strong> I Error Inflation:<\/strong>\r\n\r\n<strong>\u00a0<\/strong>\r\n\r\nRecall that if we do multiple comparisons on the same data set (e.g. <strong>group 1 vs group 2<\/strong>, <strong>group 1 vs group 3 <\/strong>and <strong>group 2 vs group 3<\/strong>) then our probability of getting an incorrect interpretation goes up. If you have three groups, that probability goes up from 0.05 to 0.14 for all the comparisons taken together.\r\n\r\n<strong>\u00a0<\/strong>\r\n\r\nThis means that if we want to compare the individual groups, we need to use a special test that calculates a \u201cfamily error rate\u201d (the error rate for all the groups considered together) rather than individual error rates. These are called \u201c<em>post-hoc\u201d<\/em> or \u201cmultiple comparison\u201d tests.\r\n\r\n<strong>\u00a0<\/strong>\r\n\r\nIf you find a significant result with your ANOVA (i.e. if you run ANOVA and your p&lt;0.05), then you should run a multiple comparison test to see which of the groups are significantly different from each other.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n\r\nThe most commonly used multiple comparison test is Tukey\u2019s Test. To do a Multiple Comparison test, click on \u201ccomparisons\u201d in the initial ANOVA menu\r\n<ul>\r\n \t<li>Check the box for Tukey\u2019s test, and click OK<\/li>\r\n \t<li>Then run the ANOVA as before. (You will see the ANOVA output, and some additional information that lets you compare each group)<\/li>\r\n<\/ul>\r\nOutput:\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-36.png\" alt=\"\" width=\"638\" height=\"641\" class=\"alignnone size-full wp-image-427\" \/>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Interpretation: <\/strong>Look for groupings that have a different letter under the \u201cGrouping\u201d heading. In the example above, the pulse 1 and pulse 2 groups for the non-runners, and the pulse 1 group for the runners all have the same letter, so they are not significantly different. However, the pulse 2 data for the runners has a different letter, so that means it is significantly different than all the other groups. To find out how it is different, look at the column with the means: Pulse2ran clearly has a higher mean value than the other ones.\r\n\r\n&nbsp;\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>Therefore, to report this trend, you\u2019d write something like:<\/strong>\r\nThere was a significant difference in pulse rates in the groups of students (ANOVA, p &lt;0.001). There was no difference in resting pulses of the two groups of students (runners vs non runners) prior to running, but there was a significant increase in pulse rate in the running group after running (Tukeys test, p&lt;0.05).\r\n<h5>Comparing &gt;2 groups of data with subgroups<\/h5>\r\n<\/div>\r\n<div>\r\n\r\n<strong>Factorial Analysis of Variance\u00a0 <\/strong><strong>(This can be a 2-way ANOVA, 3-Way, etc.)<\/strong>\r\n\r\n<strong>\u00a0<\/strong>If your data contain distinct subgroups, you can run an ANOVA that lets you test the effects of those subgroups, or factors, on your response at the same time as testing your main grouping factor. This is called a factorial analysis.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Important note: Minitab requires a \u201cbalanced design\u201d for 2-way ANOVA<\/strong>\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nConsider an example where a researcher would like to know whether a particular feed supplement would increase growth in chickens, and whether the sex of the chicken would affect how it worked. This is a standard \u201ctwo-way\u201d design, where we are looking at two factors at the same time.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Data<\/strong> <strong>Set-up<\/strong>\r\n\r\nSet up the data in Minitab so that the response data (weight) are in one column, and the grouping factors are in other columns:\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\nThis is known as a factorial table. If your data can be set up in this way to show groupings, then it is a good candidate for a two-way ANOVA.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nTable 1. Weight (grams) after two weeks in a group of bantam chicks on two different diets; the standard diet, and one supplemented by blueberry extract.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>&nbsp;\r\n\r\n&nbsp;\r\n\r\nmales<\/td>\r\n<td>supplement\r\n\r\n&nbsp;\r\n\r\n590<\/td>\r\n<td>control\r\n\r\n&nbsp;\r\n\r\n440<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>530<\/td>\r\n<td>570<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>550<\/td>\r\n<td>509<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>570<\/td>\r\n<td>510<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>650<\/td>\r\n<td>589<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>females<\/td>\r\n<td>530<\/td>\r\n<td>550<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>580<\/td>\r\n<td>420<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>520<\/td>\r\n<td>440<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>520<\/td>\r\n<td>520<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><\/td>\r\n<td>560<\/td>\r\n<td>370<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n\r\n<strong>Assumption<\/strong><strong> Testing:<\/strong>\r\n<ul>\r\n \t<li><strong>Normality (A-D): <\/strong>\r\n<ul>\r\n \t<li>control, female, p=0.64<\/li>\r\n \t<li>control, male, p=0.52<\/li>\r\n \t<li>supplement, female, p=0.21<\/li>\r\n \t<li>supplement, male, p=0.59<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<strong>All data are normally distributed<\/strong>\r\n\r\n<strong>Equal Variance:<\/strong>\r\n<ul>\r\n \t<li>Use the \u201cTest for equal variances\u201d in the ANOVA menu.<\/li>\r\n \t<li>Set it up as shown below:<\/li>\r\n<\/ul>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-37.png\" alt=\"\" width=\"731\" height=\"492\" class=\"alignnone size-full wp-image-428\" \/>\r\n\r\n<strong>Equal Variance Output:<\/strong>\r\n\r\n<strong>p&gt;0.05, therefore accept the null <\/strong><strong>that variances are equal.<\/strong>\r\n\r\n<\/div>\r\n<div>\r\n\r\n<strong>Now<\/strong> <strong>run the test:<\/strong>\r\n<ul>\r\n \t<li><strong>\u00a0<\/strong>Select <strong>Stat <\/strong>from the property bar, then choose <strong>ANOVA,<\/strong> then <strong>Two-Way<\/strong><\/li>\r\n \t<li><strong>\u00a0<\/strong>Select your variables:\r\n<ul>\r\n \t<li><strong>Response = <\/strong>the data column, so, <strong>weight Row and column factors:<\/strong><\/li>\r\n<\/ul>\r\n<\/li>\r\n \t<li>Before doing a 2-way ANOVA, you should set up a table to assess your factors<\/li>\r\n \t<li>Use that to decide which is a row factor and which is a column factor.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\n&nbsp;\r\n\r\n<strong>First, plot the data to see the patterns you want to test:<\/strong>\r\n\r\n<\/div>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-38.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-430\" \/><strong>\r\n<\/strong>\r\n<div>\r\n\r\n<strong>Two Way ANOVA Output:<\/strong>\r\n\r\n<strong> <img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-39.png\" alt=\"\" width=\"485\" height=\"410\" class=\"alignnone size-full wp-image-431\" \/><\/strong>\r\n<ul>\r\n \t<li><\/li>\r\n<\/ul>\r\n<strong>Interpretation:<\/strong>\r\n\r\n<strong>Step 1: look at p-values for individual factors<\/strong>\r\n\r\nSex: p=0.051\r\n<ul>\r\n \t<li>Since p&gt;0.05, we accept the null and conclude there is no difference Food: p = 0.011\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 since p&lt;0.05, we reject the null and conclude there is a difference<\/li>\r\n<\/ul>\r\n<strong>Step 2: look at the p-value for interaction<\/strong>\r\n\r\nInteraction: p=0.577,\r\n<ul>\r\n \t<li>Since p&gt;0.05, there is no interaction with respect to the weight response among the factors (i.e. weight acted the same way for both sexes (was lower) regardless of the food type).<\/li>\r\n<\/ul>\r\n<strong>Interaction<\/strong> refers to whether the response acts the same way for the different levels of the factors.\r\n<ul>\r\n \t<li>e.g. If the growth was higher on blueberries for males, and lower for blueberries for females, that would be an example of a different response for the two sexes... This would give a significant interaction.<\/li>\r\n \t<li>Since growth was lower for females than males for both food types, it means there was no interaction.<\/li>\r\n<\/ul>\r\n<strong>To see the actual trend, we look at the graph:<\/strong>\r\n\r\nChickens that were fed a diet supplemented by blueberry extract grew significantly larger than those on a standard diet (Two-Way ANOVA, p=0.012), and there was no significant difference in response between male and female chicks (Two-Way ANOVA, p=0.057). There was no significant interaction between food type and sex of chicks (Two-Way ANOVA, p=0.58), indicating that the food supplement affected weight in the same way for male and female chicks.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\nRemember to always say which is higher\/lower than the other, if you can\r\n\r\nTo report the interaction, be sure you explain it in English - do not just say there is a significant interaction or not.\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n<h5>Relationships between variables: Regression and Correlation<\/h5>\r\nCorrelation analysis tests whether there is a correlation between two continuous variables Regression analysis takes it a step further to give us the equation of the line and give information on how much of the variation in the points can be statistically related to the other variable.\r\n\r\n<strong>Step 1: Plot the data to see if there is a relationship between the variables <\/strong><strong>e.g. Is Weight related to Height of the students in the pulses study? If so, what is the equation of the line.<\/strong>\r\n\r\n<strong>\u00a0<\/strong>The graph suggests that the weight of the students increases as their heights increase.\r\n\r\n<\/div>\r\n<div><\/div>\r\n<div>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-40.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-432\" \/>\r\n\r\n<\/div>\r\n<div>\r\n\r\nWe can run <strong>correlation <\/strong>analysis to see how strongly related (correlated) the variables are, and we can run <strong>regression <\/strong>analysis to get the equation of the \u201cbest\u201d line that describes the relationship.\r\n\r\nWe can use <strong>regression <\/strong>to describe relationships, or to generate an equation (usually called a \u201cmodel\u201d that can be used to predict for values\r\n\r\n<strong>Assumptions: Correlation\r\n<\/strong>\r\n<ul>\r\n \t<li>Each variable must be normally distributed<\/li>\r\n \t<li>The relationship must be linear<\/li>\r\n \t<li>The residuals (errors) must be normally distributed<\/li>\r\n<\/ul>\r\n<strong>Assumptions: Regression<\/strong>\r\n<ul>\r\n \t<li>Each variable must be normally distributed<\/li>\r\n \t<li>The relationship must be linear (for linear regression)<\/li>\r\n \t<li>The residuals must be evenly distributed along the line<\/li>\r\n<\/ul>\r\n<strong>Some<\/strong> <strong>important<\/strong> <strong>definitions:<\/strong>\r\n<ol>\r\n \t<li>Independent variable: the variable that is fixed for our analysis... what we are comparing the other variable to plot this variable on the x-axis<\/li>\r\n \t<li>Dependent variable: the variable that \u201cdepends on\u201d the x-axis variable; the one we expect to change or vary with our fixed variable. Plot this variable on the y-axis<\/li>\r\n \t<li>Residual: the amount of vertical (y-axis) variation of the observed points from the regression line (i.e. the \u2018y\u2019 value of your point minus the \u2018y\u2019 value on your line)<\/li>\r\n \t<li>Standardized residual: (residual) \u00f7 (standard deviation of residual). Standardized residuals have been standardized to have a variance of 1. Standardized residuals over 2 are usually considered large, and points with that large a residual are often considered to be outliers outside of our observed range.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div>\r\n\r\n<strong>Method:<\/strong>\r\n<ul>\r\n \t<li>Test normality as described before<\/li>\r\n \t<li>Test linearity by looking at the graph and deciding if the points form a straight line<\/li>\r\n \t<li>Test the normality and linearity of the residuals through specialized menus in the Regression menu.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n<h5><strong>Correlation<\/strong><\/h5>\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-41.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-433\" \/>\r\n\r\n<\/div>\r\n<div>\r\n\r\n<strong>The graph above shows a relationship between height and weight of the student participants.<\/strong>\r\n<ul>\r\n \t<li>How related are they?<\/li>\r\n \t<li>Is the relationship significant?<\/li>\r\n<\/ul>\r\nTo test these questions, we run a correlation analysis.\r\n\r\n<strong>The correlation analysis gives us two statistics, <\/strong>the correlation coefficient, and the p-value\r\n\r\n<\/div>\r\n<div>\r\n\r\n<strong>Correlation Coeffient <\/strong>( r ) tells you how related the two variables are on a scale of zero to one\r\n<ul>\r\n \t<li>A negative \u2018r\u2019 means the relationship is negative (slopes downward)<\/li>\r\n \t<li>A positive \u2018r\u2019 means the relationship is positive (slopes upwards)<\/li>\r\n \t<li>A value of 1 or -1 means 100% related<\/li>\r\n \t<li>General rule of thumb is that a \u201cgood\u201d correlation is 0.7 to 1.0 or -0.7 to -1.0<\/li>\r\n<\/ul>\r\n<strong>P-value: <\/strong>Tells you whether the slope of your line is significantly different from zero (i.e do you have a significant relationship. If p&lt;0.05, the variables are significantly correlated\r\n\r\n<strong>Carry out the test:<\/strong>\r\n<ul>\r\n \t<li>Select <strong>Stat <\/strong>from the property bar, then choose <strong>correlation <\/strong>from the <strong>Basic statistics <\/strong>drop down menu.<\/li>\r\n<\/ul>\r\n[<strong>Note: <\/strong>do correlation analysis only if you don\u2019t need the equation of the line, or the coefficient of determination (r2)]\r\n\r\n&nbsp;\r\n\r\nWith correlation, it doesn\u2019t matter what order you put the variables in, since we\u2019re just looking at a simple relationship.\r\n\r\n<strong>Output from Minitab:<\/strong><strong>\u00a0<\/strong>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>Correlations: height, weight<\/strong>\r\n\r\nPearson correlation of height and weight = 0.786 P-Value = 0.000\r\n\r\nremember to write as p&lt;0.001\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>Interpretation:<\/strong>\r\n<ul>\r\n \t<li>The r-value = 0.786, which tells us that we have a positive correlation, and it is a \u201cgood\u201d correlation (since the value is above 0.7) tells us how strong it is.<\/li>\r\n \t<li>The p-value tells us if it is a significant correlation, while the r-value<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n\r\n<strong>Assumptions were satisfied:<\/strong>\r\n\r\nBoth sets of data are normal (based on prob. plots) and the relationship is linear (based on the graph)\r\n\r\n<\/div>\r\n<div>\r\n<h5>Regression:<\/h5>\r\nRegression analysis lets us determine the \u201cmodel\u201d (line) that best describes the data.\r\n\r\nThe regression analysis gives us three statistics:\r\n<ul>\r\n \t<li>\u00a0r2 =\u201ccoefficient of determination\u201d<\/li>\r\n \t<li>The p-value, and<\/li>\r\n \t<li>The equation of the line.<\/li>\r\n<\/ul>\r\nThe p-value is the same as for correlation\r\n\r\n<\/div>\r\n<div>\r\n\r\nThe <strong>Coefficient of determination, r<\/strong><strong>2<\/strong><strong>, <\/strong>tells us the proportion of the variation in the \u2018y\u2019 vaues that can be directly related(\"statistically\u201d) to the x-value. It is often referred to as the amount of variation \u201cexplained\u201d by the variation in \u2018x\u2019, but note that that is meant in the statistical sense.\r\n\r\n<\/div>\r\n<div>\r\n\r\nIn regression, we assume that one variable is fixed or independent, and that the other variable depends on the first. The independent variable goes on the x-axis and the dependent one goes on the y-axis.\r\n\r\n<strong>The assumptions of linear regression are:<\/strong>\r\n<ol>\r\n \t<li>Linearity: data on the graph form a reasonable line, so we say they are linear<\/li>\r\n \t<li>Data normality: Normality plots of the two variable showed that both were normally distributed<\/li>\r\n \t<li>Error normality (normality of residuals) - test this after running the regression<\/li>\r\n \t<li>Equal distribution of residuals along the line - test after running the regression<\/li>\r\n<\/ol>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>A note on regression assumptions:<\/strong>\r\n\r\nUnlike the assumptions for the group comparisons (e.g. ANOVA or t-test), many of the Regression are there to guide your interpretation, not to be an absolute guide to whether you can do the test or not.\r\n\r\n<strong>\u201cMust<\/strong> <strong>have\u201d<\/strong> asummptions:\r\n<ul>\r\n \t<li>Data must be linear for Linear Regression<\/li>\r\n \t<li>Data must be normal or near normal<\/li>\r\n<\/ul>\r\n<strong>Assumptions<\/strong> <strong>that<\/strong> <strong>guide<\/strong> <strong>interpretation:<\/strong>\r\n<ul>\r\n \t<li>Spread of residuals:\r\n<ul>\r\n \t<li>Residuals are the distance that each dependent variable point is away from the line (on the y-axis).\r\n<ul>\r\n \t<li>For the regression equation to have good predictive ability (i.e. so you can predict values of y if you know values of x), the residuals have to be normally distributed and distributed equally above and below the line for the entire relationship. For example, if there is low variability (scatter) at the lower part of the graph, and high variability (scatter) at the high part of the graph, it means that the line doesn\u2019t predict as well at the upper parts of the graph.<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n- i.e. Even if these assumptions are violoated slightly, you can still do the regression, but have to use caution if predicting \u2018y\u2019 from \u2018x\u2019\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<strong>Run<\/strong> <strong>the Regression analysis:<\/strong>\r\n<ul>\r\n \t<li>Select <strong>Stat<\/strong>from the property bar, then choose <strong>Regression <\/strong>and <strong>Regression <\/strong>and <strong>fit regression model<\/strong><\/li>\r\n \t<li>In the regression menu, put the dependent variable in the response box, and the independent variable in the predictor box<\/li>\r\n<\/ul>\r\n<strong>To test the assumptions while doing the test, <\/strong><strong>click<\/strong> <strong>on<\/strong> <strong>the \u201cGraphs\u201d box.<\/strong>\r\n<div>\r\n<ul>\r\n \t<li>In the graphs window, check the box beside <strong>Regular<\/strong>\r\n<ul>\r\n \t<li>This gives the actual values for the residuals<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n[\u2018Standardized\u2019 will convert the values based on their standard deviations so you can see them as a function of their variability. This can be useful in identifying outliers, but otherwise, the graph will look very similar]\r\n<ul>\r\n \t<li>For \u201c<strong>residual<\/strong> <strong>plots<\/strong>\u201d, check the boxes beside normal probability plot, and the residuals vs the \u201cfits\u201d (the independent var).\r\n<ul>\r\n \t<li>If you think that there may be a bias in the order the data are collected, then also plot the residuals vs the order (e.g. if you collect data over a period of time, so you might get a different response later in the day than earlier, that could be a bias)<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div>\r\n<h6>Interpretation of Regression Assumptions:<\/h6>\r\n<strong>Distribution of residuals along the line:<\/strong>\r\n\r\n<img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-42.png\" alt=\"\" width=\"717\" height=\"485\" class=\"alignnone size-full wp-image-434\" \/>\r\n<ul>\r\n \t<li>Here, you are looking for evidence that points are scattered fairly evenly along the horizontal line. This means your \u201cvariance\u201d (variability in the observed values compared to the predicted line) is spread equally along the line... analogous to the \u201cequal variance\u201d assumption in ANOVA.<\/li>\r\n \t<li>The distribution of points above and below the line isn\u2019t perfect, but is fairly uniform if you don\u2019t count the outlier point in the upper right corner.<\/li>\r\n \t<li>This distribution is close enough to pass the assumption test.<\/li>\r\n<\/ul>\r\n<strong>Normality of residuals:<\/strong>\r\n\r\n<strong> <img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-43.png\" alt=\"\" width=\"725\" height=\"486\" class=\"alignnone size-full wp-image-435\" \/><\/strong>\r\n\r\nThis graph is just like the other normality plots we look at to see if data are normal. The computer will automatically determine the residuals for you, and plot them on the normality plot.\r\n\r\n<strong>Interpretation:<\/strong>\r\n<ul>\r\n \t<li>The values are mostly on the line, again with one outlier, so conclude that they are normal<\/li>\r\n \t<li>[Just to be sure, I did a normality test on the data, and it confirm that they are normal (p&gt;0.05).]<\/li>\r\n<\/ul>\r\n<strong>The output from the Regression analysis:<\/strong>\r\n\r\n<strong> <img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-45.png\" alt=\"\" width=\"518\" height=\"676\" class=\"alignnone size-full wp-image-436\" \/><\/strong>\r\n\r\nRegression Analysis Interpretation:\r\n\r\n<\/div>\r\n<div>\r\n<ol>\r\n \t<li>Equation of the line. This is in the form of a straight line, y = mx + b<\/li>\r\n \t<li>The r2 value tells us the proportion of the variability in weight that can be \u201cexplained\u201d (or is statistically related to) by height, so 62% of the weight variation is mathematically related to height. 38% is related to other, unmeasured factors.<\/li>\r\n \t<li>The p-value tests whether the slope of the line is significantly different from zero (if there is too much variability in the points, you can\u2019t be sure that you have a relationship).<\/li>\r\n<\/ol>\r\n<strong>The assumptions were satisfied, so we can generally trust our result:<\/strong>\r\n<ul>\r\n \t<li>There is a significant positive relationship between height and weight.<\/li>\r\n \t<li>The assumption testing tells us we have one outlier value (we could investigate that more closely if we wanted), and that there is a bit more variation at higher levels of \u2018x\u2019 (height) than at lower levels. Therefore, we know that our line does not predict as well for tall people as for short people.<\/li>\r\n \t<li>Although there is a significant relationship, there is still quite a bit of unexplained variation, so height is not the only factor that affects weight of the students.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div><img src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-46.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-437\" \/><\/div>\r\n<div>\r\n\r\nFigure 1. Relationship between height and weight for a group of university students taking part in an exercise on how running affects pulse rates.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>\r\n<div>\r\n\r\n<strong>How do you report results of your regression analysis?<\/strong>\r\n\r\nYour trend statement must give the pattern from your graph, then summarize the statistical results (remember to always plot your data before doing analysis, and include a figure legend). An example from the figure above might read:\r\n\r\nThere was a strong positive relationship between height and weight for the university students taking part in the running exercise, so that as height increased, so did weight (Figure 1, Regression analysis, r2 = 0.617). About 62% of the variation in weight was statistically explained by the variation in height, but the calculated regression line had better predictive ability at lower levels of height than for higher levels of height (Figure 1).\r\n\r\n<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div>\r\n\r\n<strong>Pulses Dataset used for analyses in this manual<\/strong>.\r\n\r\nA group of students were separated into 2 groups. One group ran on the spot for one minute, and the other group did not. Following the trial, information was gathered on whether they smoked, what their sex was, what their height and weight were, and what their normal activity levels were.\r\n<ul>\r\n \t<li>Pulse1 = resting pulse for all.<\/li>\r\n \t<li>Pulse2 = pulse following the running. (Beats per minute)<\/li>\r\n \t<li>Weight is in pounds, Height is in inches<\/li>\r\n \t<li>ran = 1 means they ran,<\/li>\r\n \t<li>ran = 2 means they did not.<\/li>\r\n \t<li>smokes = 1 means they smoked.<\/li>\r\n \t<li>sex = 1 is male<\/li>\r\n \t<li>sex = 2 is female<\/li>\r\n \t<li>activlev is their normal activity level; 1 is slight, 2 is moderate, and 3 is high.<\/li>\r\n<\/ul>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>pulse1<\/td>\r\n<td>pulse2<\/td>\r\n<td>ran<\/td>\r\n<td>smokes<\/td>\r\n<td>sex<\/td>\r\n<td>height<\/td>\r\n<td>weight<\/td>\r\n<td>activ.level<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>&nbsp;\r\n\r\n64<\/td>\r\n<td>&nbsp;\r\n\r\n78<\/td>\r\n<td>&nbsp;\r\n\r\n1<\/td>\r\n<td>&nbsp;\r\n\r\n2<\/td>\r\n<td>&nbsp;\r\n\r\n1<\/td>\r\n<td>&nbsp;\r\n\r\n66<\/td>\r\n<td>&nbsp;\r\n\r\n140<\/td>\r\n<td>&nbsp;\r\n\r\n2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>58<\/td>\r\n<td>75<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>72<\/td>\r\n<td>145<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>82<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>73.5<\/td>\r\n<td>160<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>66<\/td>\r\n<td>85<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>73<\/td>\r\n<td>190<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>64<\/td>\r\n<td>82<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>69<\/td>\r\n<td>155<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>74<\/td>\r\n<td>84<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>73<\/td>\r\n<td>165<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>84<\/td>\r\n<td>84<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>72<\/td>\r\n<td>150<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>72<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>74<\/td>\r\n<td>190<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>75<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>72<\/td>\r\n<td>195<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>76<\/td>\r\n<td>88<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>71<\/td>\r\n<td>138<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>80<\/td>\r\n<td>104<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>74<\/td>\r\n<td>160<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>80<\/td>\r\n<td>96<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>72<\/td>\r\n<td>155<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>72<\/td>\r\n<td>88<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>70<\/td>\r\n<td>153<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>76<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>67<\/td>\r\n<td>145<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>60<\/td>\r\n<td>76<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>71<\/td>\r\n<td>170<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>68<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>72<\/td>\r\n<td>175<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>66<\/td>\r\n<td>88<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>69<\/td>\r\n<td>175<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>70<\/td>\r\n<td>86<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>73<\/td>\r\n<td>170<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>80<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>74<\/td>\r\n<td>180<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>72<\/td>\r\n<td>80<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>66<\/td>\r\n<td>135<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>70<\/td>\r\n<td>106<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>71<\/td>\r\n<td>170<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>74<\/td>\r\n<td>76<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>70<\/td>\r\n<td>157<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>66<\/td>\r\n<td>102<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>70<\/td>\r\n<td>130<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>70<\/td>\r\n<td>98<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>75<\/td>\r\n<td>185<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>96<\/td>\r\n<td>140<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>61<\/td>\r\n<td>140<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>100<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>66<\/td>\r\n<td>120<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>78<\/td>\r\n<td>104<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>68<\/td>\r\n<td>130<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>82<\/td>\r\n<td>100<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>68<\/td>\r\n<td>138<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>88<\/td>\r\n<td>115<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>63<\/td>\r\n<td>121<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>112<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>70<\/td>\r\n<td>125<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>96<\/td>\r\n<td>116<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>68<\/td>\r\n<td>116<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>78<\/td>\r\n<td>118<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>69<\/td>\r\n<td>145<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>88<\/td>\r\n<td>110<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>69<\/td>\r\n<td>150<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>98<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>62.75<\/td>\r\n<td>112<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>80<\/td>\r\n<td>128<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>68<\/td>\r\n<td>125<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>62<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>74<\/td>\r\n<td>190<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>60<\/td>\r\n<td>62<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>71<\/td>\r\n<td>155<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>72<\/td>\r\n<td>70<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>69<\/td>\r\n<td>170<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>66<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>70<\/td>\r\n<td>155<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>76<\/td>\r\n<td>76<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>72<\/td>\r\n<td>215<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>pulse1<\/td>\r\n<td>pulse2<\/td>\r\n<td>ran<\/td>\r\n<td>smokes<\/td>\r\n<td>sex<\/td>\r\n<td>height<\/td>\r\n<td>weight<\/td>\r\n<td>activ.level<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>68<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>67<\/td>\r\n<td>150<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>54<\/td>\r\n<td>56<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>69<\/td>\r\n<td>145<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>74<\/td>\r\n<td>70<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>73<\/td>\r\n<td>155<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>74<\/td>\r\n<td>70<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>73<\/td>\r\n<td>155<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>68<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>71<\/td>\r\n<td>150<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>72<\/td>\r\n<td>73<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>68<\/td>\r\n<td>155<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>64<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>69.5<\/td>\r\n<td>150<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>82<\/td>\r\n<td>83<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>73<\/td>\r\n<td>180<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>64<\/td>\r\n<td>62<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>75<\/td>\r\n<td>160<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>58<\/td>\r\n<td>58<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>66<\/td>\r\n<td>135<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>54<\/td>\r\n<td>50<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>69<\/td>\r\n<td>160<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>70<\/td>\r\n<td>71<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>66<\/td>\r\n<td>130<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>61<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>73<\/td>\r\n<td>155<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>76<\/td>\r\n<td>76<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>74<\/td>\r\n<td>148<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>88<\/td>\r\n<td>84<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>73.5<\/td>\r\n<td>155<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>70<\/td>\r\n<td>70<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>70<\/td>\r\n<td>150<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>90<\/td>\r\n<td>89<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>67<\/td>\r\n<td>140<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>78<\/td>\r\n<td>76<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>72<\/td>\r\n<td>180<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>70<\/td>\r\n<td>71<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>75<\/td>\r\n<td>190<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>90<\/td>\r\n<td>90<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>68<\/td>\r\n<td>145<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>92<\/td>\r\n<td>94<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>69<\/td>\r\n<td>150<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>60<\/td>\r\n<td>63<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>1<\/td>\r\n<td>71.5<\/td>\r\n<td>164<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>72<\/td>\r\n<td>70<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>71<\/td>\r\n<td>140<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>68<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>72<\/td>\r\n<td>142<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>84<\/td>\r\n<td>84<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>69<\/td>\r\n<td>136<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>74<\/td>\r\n<td>76<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>67<\/td>\r\n<td>123<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>66<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>68<\/td>\r\n<td>155<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>84<\/td>\r\n<td>84<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>66<\/td>\r\n<td>130<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>61<\/td>\r\n<td>70<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>65.5<\/td>\r\n<td>120<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>64<\/td>\r\n<td>60<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>66<\/td>\r\n<td>130<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>94<\/td>\r\n<td>92<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>62<\/td>\r\n<td>131<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>60<\/td>\r\n<td>66<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>62<\/td>\r\n<td>120<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>72<\/td>\r\n<td>70<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>63<\/td>\r\n<td>118<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>58<\/td>\r\n<td>56<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>67<\/td>\r\n<td>125<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>88<\/td>\r\n<td>74<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>65<\/td>\r\n<td>135<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>66<\/td>\r\n<td>72<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>66<\/td>\r\n<td>125<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>84<\/td>\r\n<td>80<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>65<\/td>\r\n<td>118<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>62<\/td>\r\n<td>66<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>65<\/td>\r\n<td>122<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>66<\/td>\r\n<td>76<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>65<\/td>\r\n<td>115<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>80<\/td>\r\n<td>74<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>64<\/td>\r\n<td>102<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>78<\/td>\r\n<td>78<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>67<\/td>\r\n<td>115<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>68<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>69<\/td>\r\n<td>150<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>72<\/td>\r\n<td>68<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>68<\/td>\r\n<td>110<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>82<\/td>\r\n<td>80<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>63<\/td>\r\n<td>116<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>76<\/td>\r\n<td>76<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>62<\/td>\r\n<td>108<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>87<\/td>\r\n<td>84<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>63<\/td>\r\n<td>95<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>90<\/td>\r\n<td>92<\/td>\r\n<td>2<\/td>\r\n<td>1<\/td>\r\n<td>2<\/td>\r\n<td>64<\/td>\r\n<td>125<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>78<\/td>\r\n<td>80<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>68<\/td>\r\n<td>133<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>68<\/td>\r\n<td>68<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>62<\/td>\r\n<td>110<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>86<\/td>\r\n<td>84<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>67<\/td>\r\n<td>150<\/td>\r\n<td>3<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>76<\/td>\r\n<td>76<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>2<\/td>\r\n<td>61.75<\/td>\r\n<td>108<\/td>\r\n<td>2<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>","rendered":"<div>\n<h1>Part 4: Basic Applied Statistics using Minitab 21<\/h1>\n<\/div>\n<div>\n<p>A note about statistical significance:<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>All statistical tests are designed to test whether a pattern you see in your data is \u201cstatistically significant\u201d<\/strong>. We say something is statistically significant if our test confirms that the pattern is unlikely to have occurred by chance. To test this we look at the \u201cp-value\u201d<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>The p-value is related to the hypotheses about the data:<\/strong><\/p>\n<ul>\n<li><strong>H<\/strong><strong>0<\/strong><strong>\u00a0<\/strong>(Null Hypothesis) is that there is no difference between groups, or no relationship between variables. This is a \u201cno effect\u201d hypothesis<\/li>\n<li><strong>H<\/strong><strong>A<\/strong><strong>\u00a0<\/strong>(Alternate Hypothesis)\u00a0\u00a0 is that there is a difference or a relationship. This is sometimes called the \u201cactive\u201d hypothesis, since it indicates some sort of effect<\/li>\n<\/ul>\n<p><strong>Therefore, to interpret your data, you need to examine the graph of the data and clearly state an hypothesis, or you won\u2019t know what the p-value means!<\/strong><\/p>\n<table style=\"height: 66px;width: 391px; width: 374px;\">\n<tbody>\n<tr>\n<td style=\"width: 674.533px\">\n<div>\n<p><strong>If p&lt;0.05, reject your Null Hypothesis<\/strong><\/p>\n<p><strong>If p&gt;0.05, accept your Null Hypothesis<\/strong><\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>For each of the tests detailed in the next pages, note what the<\/strong> <strong>Null Hypothesis is, so that you can determine how to interpret<\/strong> <strong>the p-value from the test.<\/strong><\/p>\n<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-1.png\" alt=\"\" width=\"341\" height=\"831\" class=\"alignnone size-full wp-image-383\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-1.png 341w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-1-123x300.png 123w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-1-65x158.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-1-225x548.png 225w\" sizes=\"auto, (max-width: 341px) 100vw, 341px\" \/><\/p>\n<div>\n<h2>Getting started<strong>\u00a0<\/strong><\/h2>\n<p><strong>DATA<\/strong><strong> ENTRY<\/strong><\/p>\n<p>When working in a spreadsheet, the common method of entering your data is in adjoining columns. For example, if you have data such as we looked at in class on the crab temperatures, you would put your crab data for time 1 in one column, and your crab data for time 2 in the next column. However, for advanced stats, you must enter your data so that all the responses for a single variable (in this case, temperature) are in a single column, with another column giving the key for the variable.<\/p>\n<p>Note that data in the \u201cadvanced stats\u201d format are set up so that the responses are given in the columns, and the variables (e.g. whether they ran or not, what their sex was\/is given as a category number in another column. This method of data entry is necessary for most statistical packages.<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Notes<\/strong><strong> about the data examples<\/strong><\/p>\n<p>For this manual, most of the examples will be from a dataset on an imaginary group of students that were asked to take a bunch of measurements on themselves before and after running in place for one minute.<\/p>\n<p>One group of students was asked to run in place for a minute, and another group (the control) did not run.<\/p>\n<p>Students (in both groups) were asked to take their pulses (in heartbeats per minute) before and after the running exercise, then were asked to indicate whether they were male or female, whether they smoked or not, whether they thought of themselves as active or not, and so on. This data set allows us to illustrate a wide variety of statistical analyses. The dataset is at the back of this manual, and will be placed on your Moodle site, so that you can practice the exercises in this manual, and see what the answers should look like.<\/p>\n<ul>\n<li>Ran = 1 means they ran,<\/li>\n<li>Ran = 2 are the nonrunners (control)<\/li>\n<li>Smokes = 1 means they smoke<\/li>\n<li>Sex = 1 means male<\/li>\n<li>Sex = 2 means female<\/li>\n<li>Activity levels: 1 = slight, 2 = moderate, 3 = high<\/li>\n<\/ul>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<p>To start in Minitab:<\/p>\n<ul>\n<li>Open Minitab 21\n<ul>\n<li>You will see a \u201csession\u201d window on top where you\u2019ll find the record of what you\u2019ve done, as well as the text results of statistical tests.<\/li>\n<li>The worksheet on the bottom will contain your data.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><strong>Navigate in Minitab through the menus on the upper property toolbar.<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2.png\" alt=\"\" width=\"1300\" height=\"1028\" class=\"alignnone size-full wp-image-384\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2.png 1300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2-300x237.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2-1024x810.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2-768x607.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2-65x51.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2-225x178.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-2-350x277.png 350w\" sizes=\"auto, (max-width: 1300px) 100vw, 1300px\" \/><\/p>\n<p><strong>Entering data:<\/strong><\/p>\n<p>Data can be entered (typed in) directly, or copied from a spreadsheet<\/p>\n<ul>\n<li>From spreadsheet: open your spreadsheet to the desired database and select and copy the data. <strong>O<\/strong><strong>nly copy the numbers<\/strong>&#8230; do not copy column headers (that will designate your columns as text columns, and cause problems when doing the statistical analyses)<\/li>\n<li>Reenter Minitab, paste the data into the worksheet.<\/li>\n<li>Name your columns by clicking on the blank space below the column number, and typing in the name<\/li>\n<li>Save your worksheet onto your data disk or personal drive. It will save as a .mpx file<\/li>\n<\/ul>\n<\/div>\n<div>\n<p>Simple Column Statistics<strong>\u00a0<\/strong><\/p>\n<p><strong>Descriptive statistics:<\/strong><\/p>\n<ul>\n<li>Select <strong>Stat <\/strong>from the property bar, and click on \u201cbasic statistics\u201d.<\/li>\n<li>Select \u201c<strong>display descriptive statistics<\/strong>\u201d to see the Display window<\/li>\n<li>Click on the box labelled \u201cStatistics\u201d to see the range of statistics available.<\/li>\n<li>The list of variables is on the left and an empty box for the variable(s) to test is on the right.<\/li>\n<\/ul>\n<p><strong>Highlight the variable you want to test, and click select<\/strong><\/p>\n<p>There is a long list of potential statistics you can have the computer calculate, all in one operation. Check the boxes of all you would like.<\/p>\n<ul>\n<li>Click <strong>ok <\/strong>to return to the display descriptive stats window<\/li>\n<li>Click <strong>ok <\/strong>again to obtain the data.<\/li>\n<\/ul>\n<p>Your results will appear in the upper Minitab Session Window. You can copy and paste the results into your word processor spreadsheet if you want to.<\/p>\n<\/div>\n<div>\n<p><strong>Variables<\/strong> <strong>to<\/strong> <strong>choose:<\/strong> Variables must be ordinal (such as Pulse 1, Pulse 2, Weight, Height in this example; Note that your categorical variables won\u2019t work here.<\/p>\n<p>This section also allows you to see some basic graphs for your variables, by clicking on \u201cgraphs\u201d rather than\u201cstatistics\u201d<\/p>\n<ul>\n<li>Click on all of these options to see how these graphs can help you get a feel for what your data looks like.<\/li>\n<\/ul>\n<\/div>\n<div>\n<h3>Descriptive statistics for sub-groups within each column of data (e.g. males and females in your group; age groups, etc.)<\/h3>\n<p>Options:<\/p>\n<p>a)\u00a0 cut and paste in your spreadsheet, then copy into Minitab and run analysis twice<\/p>\n<p>b)\u00a0 <strong>Split the data <\/strong>in Minitab (see page 46 for method) and run analysis twice<\/p>\n<p>c)\u00a0 use the \u201c<strong>By variables<\/strong><strong>\u201d<\/strong> option and run the analyses simultaneously.<\/p>\n<p>e.g. In the pulses dataset, each of the two pulse columns (Pulse 1, Pulse 2) include groups (runners &amp; non-runners, males and females, smokers and non-smokers). If you want to compare one subgroup to the other within a single column of data, you will need descriptive statistics and normality testing on each subgroup.<\/p>\n<p><strong>Example: <\/strong>calculate the descriptive statistics for the runners and the non-runners separately, in the pulses2 column (the second pulse rate, measured after running).<\/p>\n<ul>\n<li>Go into <strong>stat <\/strong>on the property bar, and click on \u201c<strong>basic statistics<\/strong>\u201d, then <strong>\u201cdisplay descriptive statistics<\/strong>\u201d. Select the Pulse2 column, and click select. Then place your cursor in the <strong>By variables <\/strong>window, and select the group variable, Ran<\/li>\n<\/ul>\n<p>Your output (in the Session box) will have information for both subgroups in your variable (i.e. Ran 1 &amp; 2), and your graphs will also show both subgroups.<\/p>\n<p>You can see how this lets you look at multiple subgroups separately without having to do a lot of cutting and pasting. you can choose this option in your \u201cstore descriptive statistics\u201d section as well.<\/p>\n<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3.png\" alt=\"\" width=\"1339\" height=\"441\" class=\"alignnone size-full wp-image-388\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3.png 1339w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3-300x99.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3-1024x337.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3-768x253.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3-65x21.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3-225x74.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-3-350x115.png 350w\" sizes=\"auto, (max-width: 1339px) 100vw, 1339px\" \/><\/p>\n<div>\n<h3>Normality Testing<\/h3>\n<p>Many statistical tests depend on data being parametric (one part of which is normality). Normality testing is a first step for most statistical testing.<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p>The normal distribution is a type of frequency distribution which has a characteristic bell curve shape with a particular height and width for its mean (average) and Standard Deviation. We can assess normality (by eye) by plotting the frequency distribution of the data, and comparing it to the normal curve that is calculated for a data set with this mean and standard deviation. However, it can be hard to see from the frequency plot so there are other methods to assess normality.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Several methods to test normality: Generally use at least two, since not all work well for all data<\/strong><\/p>\n<p>a.\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <strong><em>frequency histogram<\/em><\/strong><strong>: <\/strong>important to see what data look like, but not very accurate for assessing whether they fit the normal distribution<\/p>\n<p>b.\u00a0\u00a0\u00a0\u00a0\u00a0 <strong><em>normal probability plot<\/em><\/strong><strong>:<\/strong> This modifies the frequency scale, so that if data are normal, they fall on a straight line. <strong><em>**This is usually the best method to assess normality<\/em><\/strong><\/p>\n<p>c.\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <strong><em>determining whether the shape of the curve fits a mathematical range <\/em><\/strong><strong>(based on \u201cskew\u201d (tails) and \u201ckurtosis\u201d (height of curve)<\/strong>). This is a great method to do as a check on the other methods, in case you just aren\u2019t sure of the interpretation.<\/p>\n<p>d.\u00a0\u00a0\u00a0\u00a0\u00a0 <strong><em>Statistical methods: <\/em><\/strong>These give some comfort because they seem quantitative, but in fact, they are not accurate in many cases so should be used with caution. Several do not work well with small sample sizes, and several don\u2019t work well if there are many \u201ctied values\u201d (the same number repeated frequently in the dataset)<\/p>\n<h3>Frequency Distributions<\/h3>\n<p>First, assess the frequency distribution as a first step to see what the data look like.<\/p>\n<ul>\n<li>Plot the histogram through the <strong>Column Statistics <\/strong>menu as described on p. 83, or through <strong>Graph, Histogram, <\/strong>as described on pp. 73-75.<\/li>\n<li>In the graphing menu, choose <strong>\u201cwith fit\u201d <\/strong>for the histogram with the normal curve.<\/li>\n<li>Select your variables as before, then click on \u201c<strong>dataview<\/strong>\u201d. Choose the \u201cdistribution tab, and make sure your distribution says \u201cnormal\u201d and click ok<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-4.png\" alt=\"\" width=\"907\" height=\"608\" class=\"alignnone size-full wp-image-389\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-4.png 907w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-4-300x201.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-4-768x515.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-4-65x44.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-4-225x151.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-4-350x235.png 350w\" sizes=\"auto, (max-width: 907px) 100vw, 907px\" \/><\/p>\n<\/div>\n<div>\n<p><strong>Does your graph match the bell?<\/strong><\/p>\n<h3>The normal probability plot<\/h3>\n<ul>\n<li>Select <strong>Graph <\/strong>from the property bar, and then choose Probability plot from the drop-down menu. When prompted, choose the \u201csingle\u201d graph, and click okay<\/li>\n<li>Choose your variable as before. The normal probability plot is the default, but if you want to test another distribution (e.g. random), you can click on \u201cdistribution\u201d.<\/li>\n<\/ul>\n<p>This is produces a plot of your frequency histogram on a special probability scale, so that if the data are normal, your points should fall on a straight line. (Remember that this is for the entire column of data; if you need a subset of the data, such as males vs females, you\u2019ll need to separate them)<\/p>\n<p>You can check this by eye, or you can do a statistical normality test on the data.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-5.png\" alt=\"\" width=\"911\" height=\"607\" class=\"alignnone size-full wp-image-390\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-5.png 911w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-5-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-5-768x512.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-5-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-5-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-5-350x233.png 350w\" sizes=\"auto, (max-width: 911px) 100vw, 911px\" \/><\/p>\n<\/div>\n<p>This plot includes the 95% confidence limit for the line. If the points generally fall along the line and are within the confidence limits lines, then you can assume normality<\/p>\n<div><\/div>\n<p><strong>The conclusion from this <\/strong><strong>graph is that the data are normal. <\/strong>The dots deviate from the line very slightly, but not by much, and most fall within the confidence limits. We can run through other methods to see if they confirm our impression from the graph.<\/p>\n<div>\n<h3>Using skew and kurtosis calculations<\/h3>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6.png\" alt=\"\" width=\"1344\" height=\"427\" class=\"alignnone size-full wp-image-391\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6.png 1344w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6-300x95.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6-1024x325.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6-768x244.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6-65x21.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6-225x71.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-6-350x111.png 350w\" sizes=\"auto, (max-width: 1344px) 100vw, 1344px\" \/><strong>Data are normally distributed if the standard error of the skew (SE<\/strong><strong>skew<\/strong><strong>)<\/strong> <strong>and the standard error of the kurtosis (SE<\/strong><strong>kurtosis<\/strong><strong>)<\/strong> <strong>fall between -1.96 and +1.96.<\/strong><\/p>\n<p><strong>Method: Determine the Standard Error (SE) of the kurtosis and skew from the skew and kurtosis values in the Descriptive stats analysis (see p. 84) <\/strong>(note that some statistical packages do this calculation for you). The equations below provide an approximation of the SE values.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-7.png\" alt=\"\" width=\"794\" height=\"122\" class=\"alignnone size-full wp-image-392\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-7.png 794w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-7-300x46.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-7-768x118.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-7-65x10.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-7-225x35.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-7-350x54.png 350w\" sizes=\"auto, (max-width: 794px) 100vw, 794px\" \/><\/p>\n<ul>\n<li>SEskew\u00a0= Skew \u00f7 \u221a (6\/n)<\/li>\n<li>SEkurtosis\u00a0= kurtosis \u00f7 \u221a (24\/n)<\/li>\n<li>n=no. of obs.<\/li>\n<\/ul>\n<p>For the Pulse 1 column:<\/p>\n<ul>\n<li>SEskew\u00a0= skew \u00f7 \u221a(6\/n)\u00a0\u00a0 = .43 \u00f7 \u221a (6\/91) = 1.67<\/li>\n<li>SEkurt = kurtosis \u00f7 \u221a (24\/n\u00a0 = -.58 \u00f7 \u221a (24\/91) = -1.13<\/li>\n<\/ul>\n<p><strong>These both fall between 1.96 and -1.96, so data are statistically normal<\/strong><\/p>\n<p><strong>Note: the SE skew value is close to non-normal, as you can see by the bars in the figure looking a bit crowded towards the left side, but it still falls in the statistical range.<\/strong><\/p>\n<p><strong>\u00a0<\/strong><strong>This test confirms that the Pulse 1 data are normally distributed<\/strong><\/p>\n<ul>\n<li><strong>\u00a0Note: \u201cData is\u201d ??\u00a0\u00a0 \u201cData are\u201d ?? The word \u201cdata\u201d is plural (the singular version is \u201cdatum\u201d) so should always be given with the plural form of the verb when written.<br \/>\n<\/strong><\/li>\n<\/ul>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Using Statistical Normality tests:<\/strong><\/p>\n<p>One big problem with statistical normality tests is that they are adversely affected by both small sample sizes and large numbers of \u201ctied values\u201d, i.e. when we have a number of duplicate values in our list of numbers. Tied values often occur if we have a large data set. Therefore the normality test must always be treated with caution, and results should ALWAYS be checked against the normal probability curve.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3><strong>\u00a0<\/strong>Using statistical normality tests<\/h3>\n<\/div>\n<div>\n<ul>\n<li>Minitab provides 3 statistical normality tests. All three will plot the \u201cnormal probability curve\u201d as part of the analysis so you can compare the test result to the graph.<\/li>\n<li>The purpose of the tests is to see whether the dots are significantly different from the line.<\/li>\n<\/ul>\n<p><strong>Method:<\/strong><\/p>\n<ul>\n<li>Choose <strong>Stat<\/strong>, Basic Statistics, then <strong>Normality Test <\/strong>(near the bottom of the drop down menu).<\/li>\n<li>In the Normality Test window, select your variable, choose your test, and click ok.<\/li>\n<\/ul>\n<p><strong><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-8.png\" alt=\"\" width=\"572\" height=\"589\" class=\"alignnone size-full wp-image-393\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-8.png 572w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-8-291x300.png 291w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-8-65x67.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-8-225x232.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-8-350x360.png 350w\" sizes=\"auto, (max-width: 572px) 100vw, 572px\" \/><\/strong><\/p>\n<p><strong>Which test to choose?<\/strong><\/p>\n<ul>\n<li><strong>Anderson Darling<\/strong>: quite strongly affected by tied values, which are often encountered in large sample sizes. If you pick the AD test, then make sure you view the probability plot to see if there are tied values.<\/li>\n<li><strong>Ryan<\/strong> <strong>Joiner (Shapiro Wilk<\/strong>): this test is useful for large sample sizes (&gt;) as it doesn\u2019t react as strongly to tied values<\/li>\n<li><strong>The Kolmogorov-Smirnov test<\/strong>: Avoid this test unless it is \/illefors corrected: It is not very powerful and will often say data are normal when they are not. If you use it, use a p-value cut-off of 0.10 rather than 0.05<\/li>\n<\/ul>\n<\/div>\n<div>\n<p><strong>The null hypothesis is that data are normal, so:<\/strong><\/p>\n<ul>\n<li><strong>p &gt; 0.05, data are normal.<\/strong><\/li>\n<li><strong>p &lt; 0.05, data are signif. diff. from normal.<\/strong><\/li>\n<\/ul>\n<p>The output is a probability graph (but without the confidence lines to help interpret) and the results of the statistical test in the small box at top right.<\/p>\n<p>Look for the <strong>P-Value <\/strong>to determine if data are normal. If p&lt;0.05, it is non-normal.<\/p>\n<p>Other information provided: mean and SD values as well as the total \u2018N\u2019 and the test value from the statistical test (in this case, AD for the Anderson Darling). <strong>Do not <\/strong>confuse the test statistic with the P-Value. (If you pick the Ryan Joiner test, it will give the RJ value, and KS for Kolgomorov-Smirnov)<\/p>\n<\/div>\n<p>&nbsp;<\/p>\n<p><strong>Interpreting P-Value Results from all three statistical normality tests:<\/strong><\/p>\n<div>\n<table>\n<tbody>\n<tr>\n<td>Anderson Darling<\/td>\n<td>Ryan- Joiner<\/td>\n<td>Kolmogorov<\/p>\n<p>-Smirnov<\/td>\n<td>Note the differences in result here. Recall that the AD test is badly affected by tied values, and it will usually say data are non-normal when they are actually normal if tied values are present. The Ryan Joiner test is better for tied values, and indicates that data are right on the edge of normal (which is similar to what we saw with the skew\/kurtosis calculation). The K-S test can be used as long as the cut-off is 0.10 rather than 0.05, however these data appear non-normal.<\/td>\n<\/tr>\n<tr>\n<td>p = 0.014<\/td>\n<td>&gt;0.100<\/td>\n<td>0.016<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>To<\/strong> <strong>assess<\/strong> <strong>normality,<\/strong> <strong>always<\/strong> <strong>use<\/strong> <strong>more<\/strong> <strong>than<\/strong> <strong>one<\/strong> <strong>method:<\/strong><\/p>\n<p>For data to be normal, the histogram should look like a bell curve, the normal probability plot should have points falling close to the line, the SE of the skew and kurtosis should fall between -1.96 and +1.96, and the statistical tests should have a p value greater than 0.05 (or 0.10 for the K-S test). How did we do?<\/p>\n<p><span style=\"text-decoration: underline\"><strong>Interpretation:<\/strong><\/span><\/p>\n<table style=\"height: 150px;width: 648px; width: 667px;\">\n<tbody>\n<tr style=\"height: 60px\">\n<td style=\"width: 510.45px;height: 60px\"><\/td>\n<td style=\"width: 10.0167px;height: 60px\"><strong>\u00a0<\/strong><\/p>\n<p>&nbsp;<\/td>\n<td style=\"width: 128.133px;height: 60px\"><strong>\u00a0<\/strong><\/p>\n<p><span style=\"text-decoration: underline\">Conclusion<\/span><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 510.45px;height: 15px\">Histogram: looks a bit skewed, but not too far off<\/td>\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\n<td style=\"width: 128.133px;height: 15px\">Normal<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 510.45px;height: 15px\">Normal Prob.Plot: the dots fall within the 95% conf.<\/td>\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\n<td style=\"width: 128.133px;height: 15px\">Normal<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 510.45px;height: 15px\">Skew &amp; Kurtosis: fall within the range<\/td>\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\n<td style=\"width: 128.133px;height: 15px\">Normal<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 510.45px;height: 15px\">Anderson-Darling: p = 0.014<\/td>\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\n<td style=\"width: 128.133px;height: 15px\">Non-normal<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 510.45px;height: 15px\">Ryan-Joiner: p &gt; 0.100<\/td>\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\n<td style=\"width: 128.133px;height: 15px\">Normal<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 510.45px;height: 15px\">Kolmogorov-Smironov: p = 0.016<\/td>\n<td style=\"width: 10.0167px;height: 15px\"><\/td>\n<td style=\"width: 128.133px;height: 15px\">Non-normal<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>\u00a0<\/strong>The Anderson Darling and K-S tests give a different result than the others, but since we know it is affected by tied values, we don\u2019t use that one since there are lot of tied values in the plot. The other two indicate that data are normal or very close. <strong>Conclusion: Data are normal<\/strong><\/p>\n<p><strong>\u00a0<\/strong><strong>\u00a0<\/strong><\/p>\n<p><strong>Example using non-normal data<\/strong><\/p>\n<p>For comparison, lets look at some obviously non-normal data: The pulses 2 column:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9.png\" alt=\"\" width=\"1826\" height=\"617\" class=\"alignnone size-full wp-image-394\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9.png 1826w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9-300x101.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9-1024x346.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9-768x260.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9-1536x519.png 1536w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9-65x22.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9-225x76.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-9-350x118.png 350w\" sizes=\"auto, (max-width: 1826px) 100vw, 1826px\" \/><\/p>\n<\/div>\n<div>\n<p><strong>Interpretation of the statistical Normality tests:<\/strong><strong>\u00a0<\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td>Anderson Darling<\/td>\n<td>Ryan Joiner<\/td>\n<td>Kolmogorov- Smironov<\/td>\n<td>This time, all statistical tests indicate that data are non-normal, since the p-values are &lt;0.05 in all cases. Even though there are tied values (so we don\u2019t trust the AD test), the RJ and KS tests are clearly non-normal.<\/td>\n<\/tr>\n<tr>\n<td>p &lt;0.005<\/td>\n<td>p&lt;0.01<\/td>\n<td>P&lt;0.01<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>\u00a0<\/strong><strong>Evaluating the SE of the Skew and Kurtosis:<\/strong><strong>\u00a0<\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td>Skewness1<\/td>\n<td>Kurtosis1<\/td>\n<td>SEskew\u00a0= skew \u00f7 \/(6\/n)<\/td>\n<td>SEkurt\u00a0 = kurtosis \u00f7 \/(24\/n)<\/td>\n<\/tr>\n<tr>\n<td>1.11<\/td>\n<td>1.49<\/td>\n<td>=1.11 \u00f7 \u221a (6\/91) = 4.32<\/td>\n<td>=1.49 \u00f7 \u221a (24\/91) = 27.67<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>\u00a0Interpretation:<\/strong><\/p>\n<\/div>\n<div>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Conclusion<\/p>\n<\/div>\n<div>\n<p>Histogram: quite skewed\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal<\/p>\n<p>Normal Prob.Plot: the dots are well off the line\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal<\/p>\n<p>Skew &amp; Kurtosis: fall outside the range\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal<\/p>\n<p>Anderson-Darling: p &lt;&lt; 0.05\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal<\/p>\n<p>Ryan-Joiner: p &lt;&lt; 0.05\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal<\/p>\n<p>Kolmogorov-Smironov: p &lt;&lt; 0.05\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-normal<\/p>\n<p>These data (Pulses 2) are clearly non-normal, as all normality tests confirm this.<\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p><strong>\u00a0Comments on interpreting normality testing:<\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<ul>\n<li>Most statistical tests are \u201crobust\u201d to minor violations of normality, so as long as it is close, data can be considered normal for the purposes of doing the statistical tests to find out if groups differ or are related to each other.<\/li>\n<li>It is usually easy to tell for data that are strongly normal or strongly non-normal<\/li>\n<li>For datasets where it seems \u201cclose\u201d, the best tests are to just look at the probability plot (with confidence limits) and to check the skew and kurtosis.<\/li>\n<\/ul>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>\u00a0<\/strong><\/p>\n<h3>Parametric vs Non-parametric testing:<\/h3>\n<ul>\n<li><strong>\u00a0 Parametric tests are better at picking up statistical differences than non- parametric tests, so we prefer to use them when we can<br \/>\n<\/strong><\/li>\n<\/ul>\n<p><strong>Assumptions for parametric testing:<\/strong><\/p>\n<ul>\n<li><strong>Data are normally distributed (or close to normal) \u2013 test normality<\/strong><\/li>\n<li><strong>The variances of the groups are similar to each other \u2013 test \u201cequal variances\u201d<\/strong><\/li>\n<li><strong>Data are independent of each other \u2013\u00a0 study design point<\/strong><\/li>\n<li><strong>Data were collected in a random fashion \u2013 study design point<\/strong><\/li>\n<\/ul>\n<\/div>\n<div>\n<p><strong>\u00a0<\/strong><\/p>\n<h3>Statistical Tests<strong>\u00a0<\/strong><\/h3>\n<p><strong>Which do you use? Here is a key to the basic tests<\/strong><\/p>\n<h3>Dichotomous key to statistical tests<\/h3>\n<p>1a. Are you comparing the averages from groups of data?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;. 2<\/strong><\/p>\n<p>1b. Are you looking for a relationship between two variables<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;. 11<\/strong><\/p>\n<p>2a. Are you comparing the average from one group of numbers to a single predicted value?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;. 3<\/strong><\/p>\n<p>2b. Are you comparing the average from more than one group to averages?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;.. 4<\/strong><\/p>\n<p>3a. Are your data normally distributed&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;..<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;One Sample t-test<br \/>\n<\/strong><\/p>\n<p>3b. Are your data non-normal? &#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;..<strong>One Sample Wilcoxin test<\/strong><\/p>\n<p>4a. Are you comparing the averages of two groups?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;5<\/strong><\/p>\n<p>4b. Are you comparing the averages of three or more groups?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;.8<\/strong><\/p>\n<p>5a. Are your data paired? (i.e. are you measuring something at time a and b on the same individuals?&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;<strong>Paired t-test or non-parametric paired test<br \/>\n<\/strong><\/p>\n<p>5b. Are your data unpaired (i.e. are you just comparing the average values for your groups?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;.6<\/strong><\/p>\n<p>6a. Are your data non-normal?&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;..<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;Mann-Whitney U-test<br \/>\n<\/strong><\/p>\n<p>6b. Are your data normally distributed<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;7<\/strong><\/p>\n<p>7a. Are your data normal with equal variance&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;..<strong>Student\u2019s t-test<\/strong><\/p>\n<p>7b. Are your data normally distributed with unequal variance?&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;..<strong>Students t-test, with variance correction<\/strong><\/p>\n<p>8a. Are you comparing averages of three or more groups without subgroups?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;.9<\/strong><\/p>\n<p>8b. Do your data have a subgroup or factor you want to compare (e.g. response of males and females within different treatment groups)?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;..10<\/strong><\/p>\n<p>9a. Are your data normally distributed with equal variance&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;.<strong>One Way ANOVA<\/strong><\/p>\n<p>9b. Are your data QRQ normal or have unequal variance?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;Kruskall-Wallis Test<\/strong><\/p>\n<p>10a. Are your data normally distributed &#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;<strong>Two-Way (factorial) ANOVA<\/strong><\/p>\n<p>10b. Are your data QRQ normal or have unequal variance?&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;..<strong>No simple non-parametric test<\/strong><\/p>\n<p>11a. Are your data normally distributed, with error distribution normal and heterogeneous?<strong>&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;.Pearson Correlation and Regression<\/strong><\/p>\n<p>11b. Are your data non-normal?&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;<strong>Spearman Correlation<\/strong><\/p>\n<\/div>\n<div>\n<p>&nbsp;<\/p>\n<h3>What are the stats telling us? Comparing Groups or Relationships<\/h3>\n<p>We could be comparing groups or looking for relationships among variables.<\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p>Our first step in doing statistical comparisons should be to plot the data with error bars, to get a visual image of what we are comparing.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h4>Comparing groups<\/h4>\n<p>One of the main types of statistical analyses we do is to compare groups of data. When we do this, we\u2019re really taking the average of the group, and comparing those averages, taking into consideration how variable the data are, and how many samples we have.<\/p>\n<p>&nbsp;<\/p>\n<p>The type of graph we plot depends on the shape of the data (see \u201cFrequency Distributions\u201d, p. 73-75).<\/p>\n<ul>\n<li>If data are normally distributed, use a bar graph with error bars<\/li>\n<li>If data are non-normal, use a box and whisker plot<strong><br \/>\n<\/strong><\/li>\n<\/ul>\n<\/div>\n<div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10.png\" alt=\"\" width=\"1826\" height=\"617\" class=\"alignnone size-full wp-image-397\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10.png 1826w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10-300x101.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10-1024x346.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10-768x260.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10-1536x519.png 1536w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10-65x22.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10-225x76.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-10-350x118.png 350w\" sizes=\"auto, (max-width: 1826px) 100vw, 1826px\" \/><\/p>\n<p>Looking at the relationship between two variables (plot x against y):<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-11.png\" alt=\"\" width=\"901\" height=\"600\" class=\"alignnone size-full wp-image-399\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-11.png 901w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-11-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-11-768x511.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-11-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-11-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-11-350x233.png 350w\" sizes=\"auto, (max-width: 901px) 100vw, 901px\" \/><\/p>\n<ul>\n<li>Again, plot the data first, to see what the actual pattern looks like. Then use statistical analysis to see whether the relationship you see with your eye is statistically significant.<\/li>\n<\/ul>\n<\/div>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Remember<\/strong> <strong>to<\/strong> <strong>test<\/strong> <strong>data<\/strong> <strong>for<\/strong> <strong>normality<\/strong> <strong>before<\/strong> <strong>deciding<\/strong> <strong>which test to use. Since you only have one data set here, you do not also need to test equal variance.<\/strong><\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h4>Using statistics to compare groups:<\/h4>\n<p>We use different tests depending on the number of groups, and whether data are parametric.<\/p>\n<h5>Comparing one group to a predicted value:<\/h5>\n<ul>\n<li><strong>Parametric: one-sample t-test (compares the mean)<\/strong><\/li>\n<li><strong>Non-parametric: one-sample Wilcoxin Test (compares the median)<\/strong><strong><br \/>\n<\/strong><\/li>\n<\/ul>\n<p>One sample tests allow us to compare the average and variation in data from an <strong>observed group <\/strong>to a <strong>predicted value<\/strong>.<\/p>\n<ul>\n<li>The Null Hypothesis is that the mean of your observed data is equal to the predicted value<\/li>\n<li>If P&lt;0.05, then the means are significantly different.<\/li>\n<\/ul>\n<p><strong>Parametric Example:<\/strong> <strong><em>One sample t-test <\/em><\/strong><\/p>\n<p><strong>Study Question: Is the average resting pulse equal to 70 beats per minute?<\/strong><\/p>\n<ul>\n<li>H0: Resting pulse (Pulse 1) = 70 beats per minute<\/li>\n<li>HA: Resting pulse (Pulse 1) \u2026 70 beats per minute\n<ul>\n<li>Choose Stat from the property bar, and then Basic Statistics, and then 1 sample t<\/li>\n<li>Select your variable (e.g., <strong>pulse 1<\/strong>)<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><strong>Note<\/strong>; if your variable list is blank, just click on the white space in the \u201csamples in columns<\/p>\n<ul>\n<li>check the box for <strong>\u201cperform hypothesis test<\/strong>\u201d, and type in the value you\u2019re comparing (i.e. 70). Click OK<\/li>\n<li>Note that we have a simple alternate hypothesis here, of \u201cequal\u201d vs \u201cnot equal\u201d. If you want to be more specific and test if it is greater than or less than your hypothesized mean, click on \u201cOptions\u201d and change it<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13.png\" alt=\"\" width=\"395\" height=\"339\" class=\"alignnone size-full wp-image-401\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13.png 395w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13-300x257.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13-65x56.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13-225x193.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13-350x300.png 350w\" sizes=\"auto, (max-width: 395px) 100vw, 395px\" \/><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Interpretation:<\/strong><\/p>\n<p>p &lt; 0.05, therefore the mean of the observed resting pulses is significantly different from 70.<\/p>\n<p><strong>Trend: <\/strong>Since the mean (from the output) is 73.14, we can say that pulses are significantly higher than 70 beats\/min.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><strong>Non parametric Example:<\/strong><\/p>\n<p><strong><em>One sample Wilcoxin<\/em><\/strong><\/p>\n<p>Use this test if data are non-normal.<\/p>\n<ul>\n<li>Select <strong>Stat<\/strong>, <strong>Nonparametrics, <\/strong>then<strong>1-sample Wilcoxin<\/strong><\/li>\n<\/ul>\n<p>Set it up the same way as for the 1-sample t-test, but test a median rather than a mean.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13.png\" alt=\"\" width=\"395\" height=\"339\" class=\"alignnone size-full wp-image-401\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13.png 395w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13-300x257.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13-65x56.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13-225x193.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-13-350x300.png 350w\" sizes=\"auto, (max-width: 395px) 100vw, 395px\" \/><\/p>\n<p>Note that this non-parametric test also showed a significant p-value (p=0.028), but that this one is much closer to the 0.05 cutoff than the parametric test, reminding us that this is generally a less powerful test.<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Always<\/strong> <strong>choose the parametric test if your data are normal.<\/strong><\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<h5>Comparing 2 Groups<strong>: <\/strong><\/h5>\n<p><em>Study Question: is Pulse 1 different from Pulse 2?<\/em><\/p>\n<p><strong>\u00a0<\/strong><strong>Testing Parametric Assumptions: <\/strong>Before doing this test, remember to test each group of data for normality (p. 87-92). The other assumption for the parametric test is that variances in the groups are similar, so you will also have to do an \u201cequal variance\u201d test. We already know that data for Pulses 1 were normal, but the Pulses 2 data were not. Therefore, parametric test will give an invalid result and the correct test to do here is the non-parametric test. We will test for equal variance on p. 98. Examples with both tests are shown here so you can see how to do them.<\/p>\n<\/div>\n<div>\n<h6>Parametric test: <em>Student\u2019s t-test <\/em>(AKA 2-sample t-test)<\/h6>\n<p><strong>Step<\/strong><strong> 1: <\/strong><strong>Look at plot of data to see pattern<\/strong><\/p>\n<\/div>\n<div>\n<p><strong>Step<\/strong><strong> 2: <\/strong><strong>Setting up the data table<\/strong><\/p>\n<p><strong>Minitab has two methods:<\/strong><\/p>\n<ul>\n<li>The t-test allows us to put the data into adjoining columns(spreadsheet fashion) or to have them in one column. For this example, our Pulse 1 and Pulse 2 data are already in adjoining columns so we\u2019ll pick this option.<\/li>\n<\/ul>\n<p><strong>Step 3: <\/strong><strong>do the test: <\/strong>Choose <strong>2 sample t-test <\/strong>from the Stat\/Basic Statistics drop-down menu.<\/p>\n<p>Note the box here where it says \u201cassume equal variance\u201d &#8230; if you have tested equal variance and they are statistically equal, then check this box.<\/p>\n<p><strong>Output for t-test:<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-15.png\" alt=\"\" width=\"541\" height=\"658\" class=\"alignnone size-full wp-image-402\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-15.png 541w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-15-247x300.png 247w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-15-65x79.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-15-225x274.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-15-350x426.png 350w\" sizes=\"auto, (max-width: 541px) 100vw, 541px\" \/><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p>Interpretation: p &lt; 0.05. Therefore, If assumptions are verified (normal + equal variance), then we would conclude that Pulse 1 is OHVV WKDQ Pulse 2.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>\u00a0Assumptions:<\/strong><\/p>\n<\/div>\n<div>\n<p>1.\u00a0\u00a0\u00a0 Test Normality: Pulse 2 was not normal (see p. 92)<strong>(this<\/strong><strong> means the t-test result was not valid)<\/strong><\/p>\n<p>2.\u00a0\u00a0\u00a0 Test Equal Variance: do variance test as described below<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Test<\/strong> <strong>for equal variance for t-test:<\/strong><\/p>\n<ul>\n<li>Choose <strong>\u201c2 Variances\u201d <\/strong>from the Stat\/Basic Statistics drop-down menu.<\/li>\n<li>Select your variables that you are comparing (in this case, I chose the samples in dif. columns, but if your data were set up in a single column the result would be the same; see p. 100 for example). <strong>Click ok<\/strong><\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-16.png\" alt=\"\" width=\"708\" height=\"491\" class=\"alignnone size-full wp-image-403\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-16.png 708w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-16-300x208.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-16-65x45.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-16-225x156.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-16-350x243.png 350w\" sizes=\"auto, (max-width: 708px) 100vw, 708px\" \/><\/p>\n<p>Minitab 21 uses Bonnett and Levene Tests and produces some companion graphs. The boxplot shows the range of the data,so we can see that the variation is Pulse 2 is much higher than that for Pulse 1<\/p>\n<p>The interval plot shows us the difference in the standard deviation for the two groups, with confidence limits, so we can see that they don\u2019t overlap.<\/p>\n<p>Note the outputs for the Equal Variance test in the top right corner.<\/p>\n<ul>\n<li>Both show that p &lt; 0.05, <strong><em>so we reject the null hypothesis <\/em><\/strong>that variances are equal<\/li>\n<\/ul>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Important:<\/strong><\/p>\n<ul>\n<li>If the data passed the normality test, but failed the equal variance test, you may still be able to do a t-test<\/li>\n<li>Sample size must be &gt;10<\/li>\n<li>Leave \u201cassume equal variance\u201d box unchecked.<\/li>\n<\/ul>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Therefore, we conclude that variances are different<\/p>\n<h5>Interpretation:<\/h5>\n<p>The data did not pass the assumption tests, therefore any result from the t-test is not valid and cannot be trusted<\/p>\n<p>Therefore, we would discard our t-test result, and carry out a Mann-Whitney U-test.<\/p>\n<\/div>\n<div>\n<h4><strong>Nonparametric test:<\/strong> <strong><em>Mann-Whitney<\/em><\/strong><strong><em> U-test <\/em><\/strong>(Use this test if data are not normal)<\/h4>\n<p><strong>Setting up the data:<\/strong><\/p>\n<ul>\n<li>You must have your data in separate columns for this test.<\/li>\n<li>No assumption testing is needed for this test.<\/li>\n<\/ul>\n<p><strong>Carry out the test: <\/strong>Choose the Mann-Whitney test from the <strong>Nonparametrics <\/strong>drop down menu (from <strong>Basic Statistics<\/strong>)<\/p>\n<ul>\n<li>Select your variables, and click ok<\/li>\n<\/ul>\n<p><strong>This test compares medians (middle values in a list that is ranked from smallest to largest) rather than means. <\/strong><strong>Minitab Output:<\/strong><\/p>\n<p><strong> <img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-17.png\" alt=\"\" width=\"398\" height=\"651\" class=\"alignnone size-full wp-image-404\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-17.png 398w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-17-183x300.png 183w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-17-65x106.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-17-225x368.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-17-350x572.png 350w\" sizes=\"auto, (max-width: 398px) 100vw, 398px\" \/><\/strong><\/p>\n<p><strong>Conclusion: <\/strong><strong>no assumptions are required, and there is a signifcant difference among groups, since p &lt; 0.05<\/strong><\/p>\n<p><strong>Trend statement: <\/strong>The pulse rate of students after running (Pulse 2) was significantly higher than the pulse rate before running (Mann-Whitney U-test, P=0.0048).<\/p>\n<p><strong>This is the correct test, so we can trust our result.<\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Since the non-parametric test doesn\u2019t need us to test assumptions, why not just use it all the time?<\/strong><\/p>\n<ul>\n<li>When sample sizes are large, the non-parametric and parametric tests often come to the same general conclusion (i.e. the differences are significant or they are not), so it seems like more work to have to do all this extra testing. However, when patterns are not as clear (i.e. when p-values are close to the 0.05 cutoff), or for small sample sizes, it can make a major difference. The Parametric tests have greater <strong>POWER <\/strong>to see a difference if it actually is present, so we always use these ones if we can. Therefore, we must test assumptions first.<\/li>\n<\/ul>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<h4><strong>A second two-sample example, t-test with data in a single column.<\/strong><\/h4>\n<p><strong>Study question: Do males and females have the same average resting pulse?<\/strong><\/p>\n<ul>\n<li><strong>\u00a0<\/strong>Choose the 2-sample t-test following the instructions on p. 97 (<strong>Stat,<\/strong><strong> Basic Stats, 2-sample t)<\/strong><\/li>\n<li><em>When selecting your variable, you\u2019ll be able to choose the \u201csamples in the same column option\u201d in the t- test menu, but you\u2019ll still have to separate them to check for normality. You can do this manually, by splitting the worksheet (see p. 46), or by \u201cunstacking\u201d the column.<\/em><\/li>\n<\/ul>\n<p><strong>\u201cUnstacking\u201d a column to test subgroups separately:<\/strong><\/p>\n<ul>\n<li>Choose <strong>Data<\/strong> from the upper property bar, then <strong>Unstack<\/strong> <strong>columns <\/strong>from the drop-down menu:<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-18.png\" alt=\"\" width=\"445\" height=\"327\" class=\"alignnone size-full wp-image-405\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-18.png 445w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-18-300x220.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-18-65x48.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-18-225x165.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-18-350x257.png 350w\" sizes=\"auto, (max-width: 445px) 100vw, 445px\" \/><\/p>\n<p>\u201cSuperscripts\u201d refer to your grouping variable, so if you want to separate out the two sexes (male and female), then choose \u201csex\u201d as your subscript.<\/p>\n<ul>\n<li>Check the boxes for \u201cafter last column in use\u201d and \u201cname the columns\u201d so that the data will appear in your worksheet.<\/li>\n<\/ul>\n<p><strong>Test normality<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20.png\" alt=\"\" width=\"1508\" height=\"504\" class=\"alignnone size-full wp-image-406\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20.png 1508w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20-300x100.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20-1024x342.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20-768x257.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20-65x22.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20-225x75.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-20-350x117.png 350w\" sizes=\"auto, (max-width: 1508px) 100vw, 1508px\" \/><\/p>\n<\/div>\n<div>\n<p>Pulse 1 males: The probability plot has most values in the 95% confidence bands, and the p- Value for the Ryan Joiner statistical test is &gt;0.05 (too many tied values for AD)<\/p>\n<p><strong>Conclusion: Normal<\/strong><\/p>\n<p>Pulse 1 females: The probability plot has all values in the 95% confidence bands, and the p-Value for the Ryan Joiner statistical test is &gt;0.05<\/p>\n<p><strong>Conclusion: Normal<\/strong><\/p>\n<\/div>\n<div>\n<p>&nbsp;<\/p>\n<p><strong>Test equal variance<\/strong><\/p>\n<p><strong><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-21.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-407\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-21.png 576w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-21-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-21-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-21-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-21-350x233.png 350w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><\/strong><\/p>\n<p><strong>Conclusion: variance is equal since p&gt;0.05 for both tests<\/strong><\/p>\n<p>Important: do not stop here&#8230; these tests only assessed assumptions. Now you must do the test to test your study question.<\/p>\n<\/div>\n<p>&nbsp;<\/p>\n<div>\n<p>&nbsp;<\/p>\n<p><strong>Carry out the 2-sample t-test<\/strong><strong>, <\/strong>this time with data in one column.<\/p>\n<p>Since variances are equal, check box for \u201cAssume equal variances\u201d.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Two-sample T for pulse1<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-22.png\" alt=\"\" width=\"565\" height=\"668\" class=\"alignnone size-full wp-image-408\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-22.png 565w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-22-254x300.png 254w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-22-65x77.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-22-225x266.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-22-350x414.png 350w\" sizes=\"auto, (max-width: 565px) 100vw, 565px\" \/><\/p>\n<p><strong>Conclusion: <\/strong><strong>Assumptions are satisfied, so test is valid and groups are significantly different since p&lt;0.05.<\/strong><\/p>\n<p><strong>\u00a0<\/strong><strong>Trend statement: <\/strong><strong>Average pulse rate is significantly higher for females (group 2) than for males (group 1) (t-test, p=0.008)<\/strong><\/p>\n<ul>\n<li><strong>\u00a0Remember to always write a trend statement giving the clear trend in the data, with the name of the statistical test and the p-value in brackets.<\/strong><strong><br \/>\n<\/strong><\/li>\n<\/ul>\n<p><strong>\u00a0<\/strong><\/p>\n<table style=\"width: 809px;height: 154px\">\n<tbody>\n<tr>\n<td style=\"width: 794.033px\">\n<div>\n<p><strong>How should you report p-values?<\/strong><\/p>\n<ul>\n<li>When possible, report the actual p-value if you have it from the computer test<\/li>\n<li>Rule of thumb: <strong>only report to a max. of 3 decimal places<\/strong>:<\/li>\n<li>If the p is given as 0.000 or 0.0000 then write as p&lt;0.001\n<ul>\n<li>Even if the computer reports a value like 0.000, <strong>NEVER <\/strong>say p=0.000, since probability is always higher than zero<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Interesting side note:<\/strong><\/p>\n<p>These questions illustrate how many questions can come out of a single dataset, if the sample size is large enough, and the study design incorporated many variables. In the previous example, we tested whether two different groups of students were different from each other. In this example, we are testing to see whether an individual group of students can show significant differences from one time<\/p>\n<p>to another. For this to work, the data must be \u201cpaired\u201d&#8230; i.e. we must identify each individual and be sure we know his\/her before and after result.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Two sample testing when values are \u201cPaired\u201d <\/strong><strong><em>Study<\/em><\/strong><strong><em> question Did the pulse rates of the individual students go up after running?<\/em><\/strong><\/p>\n<p><strong>Note<\/strong> <strong>the<\/strong> <strong>importance<\/strong> <strong>of<\/strong> <strong>how the question is worded:<\/strong><\/p>\n<p>If we want to know if the average pulse rate goes up after running, we would do a regular t-test or Mann-Whitney U-test (if non-normal). But because we have data on each individual person, we can see if the individual rates go up by using a test that focuses on individual responses, called a \u201cpaired\u201d test. This is particularly useful if high variability in individuals makes it difficult to see a pattern in the average response, and the paired test will have more power to see the differences than the regular one.<\/p>\n<p><strong>How<\/strong> <strong>Paired tests work:\u00a0 <\/strong>In paired tests, we look at a measured value for known individuals at two (or more) times.<\/p>\n<p>The example below is for the pulses in the group who ran in place for one minute, so you can see that their pulse rates all went up, though not by the same amount.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24.png\" alt=\"\" width=\"1408\" height=\"276\" class=\"alignnone size-full wp-image-409\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24.png 1408w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24-300x59.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24-1024x201.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24-768x151.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24-65x13.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24-225x44.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-24-350x69.png 350w\" sizes=\"auto, (max-width: 1408px) 100vw, 1408px\" \/><\/p>\n<p>We can test this using a one-sample test, to see if the group of numbers in the difference column is significantly different from zero.<\/p>\n<p><strong>For parametric data, there is a t-test that calculates the differences and does the one sample test automatically in a single step. For non-parametric data, we have to do the extra step ourselves.<\/strong><\/p>\n<p><strong>Step 1: <\/strong>separate out the runners from the non- runners in both our resting pulse and our after running pulse columns (pulse 1 and pulse 2). <strong>This will give us four groups of data as you can see from the graph at right.<\/strong><\/p>\n<p><strong>Remember<\/strong> <strong>to<\/strong> <strong>plot<\/strong> <strong>your<\/strong> <strong>data<\/strong> <strong>so<\/strong> <strong>you<\/strong> <strong>can<\/strong> <strong>see<\/strong> <strong>what you are comparing!<\/strong><\/p>\n<ul>\n<li>We can do this manually (by copying and pasting into additional columns, \u201csplitting\u201d the worksheet into multiple smaller worksheets to work on using Minitab or \u201cunstacking\u201d the columns.<\/li>\n<\/ul>\n<\/div>\n<div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-25.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-411\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-25.png 576w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-25-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-25-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-25-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-25-350x233.png 350w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><\/p>\n<p>Figure 1. Comparison of pulse rates for a group of students before and after one group ran for 1 Min.<\/p>\n<\/div>\n<div>\n<p><strong>Step 2: Compare pulse 1 and pulse 2 for the non-runners (control) and then the runners (test)<\/strong><\/p>\n<p><strong>First: Test assumptions:<\/strong><\/p>\n<p><strong>1. <\/strong><strong>Normality testing<\/strong>: test normality of all four groups, using Ryan Joiner (since many tied values)<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26.png\" alt=\"\" width=\"1319\" height=\"871\" class=\"alignnone size-full wp-image-412\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26.png 1319w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26-300x198.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26-1024x676.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26-768x507.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26-225x149.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-26-350x231.png 350w\" sizes=\"auto, (max-width: 1319px) 100vw, 1319px\" \/><\/p>\n<p><strong>2.\u00a0\u00a0\u00a0 <\/strong><strong>Equal variance<\/strong>: Running group p=0.004\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Non-running group: p=0.624<\/p>\n<p>(Variances between before &amp; after running: non-equal for runners, equal for non-runners)<\/p>\n<p><strong>Conclusion from assumptions:<\/strong><\/p>\n<ul>\n<li>Runners: one group was normal the other non normal; variances were not equal<\/li>\n<li>Non-runners: data were normal, and variances were equal<\/li>\n<\/ul>\n<p>Therefore, both the parametric and non-parametric paired tests are needed to assess whether there are differences between the individuals pulse rates at the two different times.<\/p>\n<\/div>\n<div>\n<h4><strong>Parametric data &#8211; use the Paired t-test<\/strong><\/h4>\n<p>Run this test using the Non-runners\u2019 pulse rate data, since these data were normal and had equal variance.<\/p>\n<p><strong>Make sure the data to be tested are in separate columns, e.g. non-runners at time 1 and non-runners at time 2.<\/strong><\/p>\n<ul>\n<li>Select <strong>Stat <\/strong>from the property bar, and <strong>Basic Statistics.<\/strong><\/li>\n<li>Choose the <strong>paired t test <\/strong>from the drop down menu.<\/li>\n<li>Choose variables so that your data for the non-runners from time 1 is being compared to non-runners in time 2.<\/li>\n<\/ul>\n<p><strong>Minitab Output:<\/strong><\/p>\n<p>&nbsp;<\/p>\n<p><strong>Paired T-Test and CI: pulse1nonran, pulse2nonran<\/strong><strong>\u00a0<\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Since p&gt;0.05, we accept the null hypothesis, and say there is no difference in the groups.<\/strong><\/p>\n<p>Notice the difference here in what is being tested: instead of comparing one mean to another, it tests whether the mean difference is equal to zero<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<h4><strong>Non-parametric paired two sample test:<\/strong><\/h4>\n<ul>\n<li>If data are non-normal, then you must use a non-parametric test. There is no one-step non-parametric paired test, but you can carry it out in two steps.<\/li>\n<\/ul>\n<p><strong>Step one: <\/strong>Subtract your time 1 column from your time 2 column (as in the table on p. 103). You can do this in Excel, and copy and paste the values into Minitab.<\/p>\n<p><strong>Step two: <\/strong>Carry out a one-sample Wilcoxin test to test whether your median is different from zero (as shown on p. 96).<\/p>\n<p>Choose your column with the runner differences, and check the box for testing that the median is zero:<\/p>\n<p>Minitab Output:<\/p>\n<p><strong> <img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-28.png\" alt=\"\" width=\"246\" height=\"168\" class=\"alignnone size-full wp-image-416\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-28.png 246w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-28-65x44.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-28-225x154.png 225w\" sizes=\"auto, (max-width: 246px) 100vw, 246px\" \/><br \/>\n<\/strong><\/p>\n<p><strong>Wilcoxon Signed Rank Test: run difs<\/strong><\/p>\n<p><strong>Since p&lt;0.001, then we reject the Null hypothesis; i.e. the pulse rates are significantly different between time 1 and time 2.<\/strong><\/p>\n<\/div>\n<p>&nbsp;<\/p>\n<div>\n<h4><strong>Comparing &gt;2 groups of data<\/strong><\/h4>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Balanced<\/strong> <strong>Designs<\/strong><\/p>\n<p>Some packages (e.g. Excel) assume a \u201cbalanced design\u201d; that means that there have to be the same number of observations in each group being studied. In Minitab, a balanced design is not necessary for simple ANOVA (although it is in 2-way ANOVA), but note that if your groups have too few observations, you have very little \u201cpower\u201d to pick out differences.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Analysis of Variance (ANOVA) or the non- parametric equivalent (Kruskall-Wallis) allows us to compare more than 2 groups of data. As with the t-test, we are comparing means, and the null hypothesis is that the means are equal.<\/p>\n<ul>\n<li><strong>If p&lt;0.05, the means are statistically different (assuming the assumptions of ANOVA are met).<\/strong><\/li>\n<\/ul>\n<p><strong>Setting up your data:<\/strong><\/p>\n<p>Your dataset can be set up in the worksheet so that the responses being measured are all in one column, and the groups that you are comparing are given as categories in a separate column (See our pulses dataset on p. 83 as an example). Alternatively, the data can be set up so your groups are in adjoining columns. For the ANOVA, these have special names: If data are in one column, Minitab refers to this as \u201c<strong>stacked<\/strong>\u201d. If data are in adjoining columns, Minitab refers to this set-up as \u201c<strong>unstacked<\/strong>\u201d.<\/p>\n<h5>Parametric Data: Analysis of Variance (ANOVA)<\/h5>\n<p><strong>One-Way Design <\/strong>(comparing groups without subgroups)<\/p>\n<p>Example of comparing three or more groups of data; \u201cstacked\u201d data. <strong>Study Question: <em>Are resting pulse rates different depending on normal activity level?<\/em><\/strong><\/p>\n<ul>\n<li>This will give three groups to compare, with three activity levels.<\/li>\n<\/ul>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p>Important design note: The design of this study is highly unbalanced, and the group reporting low activity level was very small in number; it may be too small to give random and independent data. Therefore, ANOVA result should be treated with caution.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Assumptions:<\/p>\n<p>ANOVA assumes that data are parametric, which means:<\/p>\n<ul>\n<li>Normally distributed<\/li>\n<li>Variances are equal<\/li>\n<li>Design should be set up so data are independent and random.<\/li>\n<\/ul>\n<p><strong>1. <\/strong><strong>Test for normality <\/strong>for each group of data as explained in the earlier section<\/p>\n<ul>\n<li>When this was done, all groups were normal (p&gt;0.05)<\/li>\n<\/ul>\n<p><strong>2. <\/strong> <strong>Equal Variance Test<\/strong>: <strong>Do not use the equal variance test you used for t-tests!<\/strong><\/p>\n<ul>\n<li>To test equal variance on three or more groups of data, we need to use the equal variance test found in the ANOVA menu. The one in the t-test menu only tests variance for two groups of data, not for three or more.<\/li>\n<\/ul>\n<p>Select <strong>Stat <\/strong>from the property bar, then <strong>ANOVA<\/strong>. Look part way down the drop-down menu, and select <strong>Test for Equal Variances<\/strong><\/p>\n<p>Choose your response variable (in this case, pulses 1 to assess the resting pulse rates) and your factor variable (in this case, activity level), and click okay.<\/p>\n<p><strong>Equal variances result:<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30.png\" alt=\"\" width=\"1230\" height=\"866\" class=\"alignnone size-full wp-image-420\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30.png 1230w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30-300x211.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30-1024x721.png 1024w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30-768x541.png 768w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30-65x46.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30-225x158.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-30-350x246.png 350w\" sizes=\"auto, (max-width: 1230px) 100vw, 1230px\" \/><\/p>\n<\/div>\n<div>\n<p>p&gt;0.05 so variances are equal<\/p>\n<p>Notice in the graph how the variances are all very similar with a lot of overlap&#8230; that shows that the variances are very similar.<\/p>\n<p><strong>Conclusion: <\/strong>Although we need to be cautious about interpreting patterns about the low activity level due to the small and unbalanced sample size, our data meet the assumptions of the ANOVA, so we can run the test.<\/p>\n<p><strong>Method for One-Way ANOVA:<\/strong><\/p>\n<ul>\n<li>Choose \u201c<strong>stat<\/strong>\u201d from the property bar, then choose \u201c<strong>ANOV<\/strong>A\u201d, then choose \u201c<strong>one-way<\/strong>\u201d. One way analysis of variance means that you are simply comparing 3 or more groups of data, without any subgroups in them.<\/li>\n<\/ul>\n<p>This puts you into the menu for the ANOVA with data set up in one column<\/p>\n<p><strong>The<\/strong><strong> Null hypothesis <\/strong>is that<\/p>\n<p>X1 = X2 = X3<\/p>\n<p>therefore, a significant p-value (p&lt;0.05) means that at least one group is different from the others.<\/p>\n<\/div>\n<div>\n<ul>\n<li><strong>Select your variables<\/strong>\n<ul>\n<li>The \u201c<strong>response<\/strong>\u201d variable is the one where your measurements are, for example one of the columns of pulse rates.<\/li>\n<li>The \u201c<strong>factor<\/strong>\u201d is the column where the different groupings or levels are shown.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/div>\n<div>\n<p>In our pulses dataset, we have 3 levels of activity, so we could choose activity level as our factor.<\/p>\n<p><strong>Click<\/strong> <strong>on<\/strong> <strong>\u201cokay\u201d,<\/strong> <strong>and<\/strong> <strong>the<\/strong> <strong>output<\/strong> <strong>will look like:<\/strong><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><strong><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-31.png\" alt=\"\" width=\"532\" height=\"669\" class=\"alignnone size-full wp-image-421\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-31.png 532w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-31-239x300.png 239w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-31-65x82.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-31-225x283.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-31-350x440.png 350w\" sizes=\"auto, (max-width: 532px) 100vw, 532px\" \/><\/strong><\/p>\n<p><strong>Interpretation: <\/strong>p&gt;0.05, so the means <strong>are not significantly different <\/strong>between the groups. We conclude that activity level <strong>does not <\/strong>have an effect on resting pulse in these students.<\/p>\n<\/div>\n<p><strong>Remember to always plot your data <\/strong>to see the patterns. This helps you to determine what you actually want to test, and can help you interpret your data patterns. You can see from the graph at right that although it looks like the people with low activity level had a higher resting pulse rate, the variability in the data means that there is no significant difference<\/p>\n<div><\/div>\n<div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-32.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-422\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-32.png 576w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-32-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-32-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-32-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-32-350x233.png 350w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><\/p>\n<p><strong>Conclusion: we have done the appropriate<\/strong> <strong>test, so we can trust our result.<\/strong><\/p>\n<p><strong>Trend Statement: <\/strong><strong>There is no significant difference among resting pulses in the three groups (ANOVA, p = 0.155).<\/strong><\/p>\n<h5><strong>Non-parametric data: Kruskall-Wallis Test<\/strong><\/h5>\n<\/div>\n<div>\n<p>If data were <strong>not <\/strong>normal [and could not be made normal through transformation], we would use the Kruskal-Wallis test, which compares medians.<\/p>\n<p><strong>Data Setup:<\/strong><\/p>\n<ul>\n<li>Data must be set up so that the response variable (in this case, Pulse rate) is in one column, and the grouping variable is in another column.<\/li>\n<li>Select \u201c<strong>non-parametrics<\/strong>\u201d from the \u201c<strong>stats<\/strong>\u201d menu, and choose the Kruskal-Wallis.<\/li>\n<li>Select your variables:<\/li>\n<\/ul>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Transforming<\/strong>:<\/p>\n<ul>\n<li>If your data are not normal and don\u2019t have equal variance, you can use the non-parametric test, or you can try transforming the data. Common transformations are the <strong>log transformation, square root transformation, and inverse tranformation <\/strong>\u2013 To transform, convert all values in each column of data being analysed to the transformation, and try your analyses again.<\/li>\n<li>Remember to always report your actual data (in the text or in graphs) and not the transformed data<\/li>\n<\/ul>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-33.png\" alt=\"\" width=\"500\" height=\"608\" class=\"alignnone size-full wp-image-423\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-33.png 500w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-33-247x300.png 247w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-33-65x79.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-33-225x274.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-33-350x426.png 350w\" sizes=\"auto, (max-width: 500px) 100vw, 500px\" \/><\/p>\n<p><strong>Trend statement: <\/strong><strong>There is no significant difference among the pulse rates for the students who have the different activity levels (Kruskal-Wallis test, p=0.153)<\/strong><\/p>\n<\/div>\n<div>\n<h4><strong>Example of comparing three or more groups of data; \u201cun-stacked\u201d data<\/strong><\/h4>\n<p><strong>Study Question: <\/strong><em>Is there a difference among the runners and non-runners, before and after running?<\/em><\/p>\n<p>Running Group<\/p>\n<p><strong>Data Setup: <\/strong>For this method, put the data into 4 separate columns, and run a <strong>One-Way ANOVA<\/strong>, <strong>unstacked<\/strong>.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-34.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-425\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-34.png 576w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-34-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-34-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-34-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-34-350x233.png 350w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><\/p>\n<p>A quick plot of the data shows no dif. in pulse between time 1 and 2 for the non-runners, but there seems to be a big difference for the runners. What we would like to know is whether that difference is significant.<\/p>\n<p><strong>The null hypothesis is that there is no difference among groups.<\/strong><\/p>\n<p><strong>\u00a0<\/strong><strong>Assumptions<\/strong>:<\/p>\n<ul>\n<li><strong>Normality: <\/strong>some groups were normal, and others were not. The non-normal groups were relatively close to normal.<\/li>\n<li><strong>Variances: <\/strong>were not equal<\/li>\n<\/ul>\n<p><strong>Conclusion from Assumptions:<\/strong><\/p>\n<p>Data are non-parametric <strong>(if<\/strong> <strong>even<\/strong> <strong>one<\/strong> <strong>group<\/strong> <strong>you<\/strong> <strong>are<\/strong> <strong>comparing<\/strong> <strong>don\u2019t<\/strong> <strong>fit<\/strong> <strong>assumptions,<\/strong> <strong>then<\/strong> <strong>your<\/strong> <strong>data<\/strong> <strong>are<\/strong> <strong>nonparametric)<\/strong>, so should either be transformed or a non-parametric test should be chosen. However, ANOVA is \u201crobust\u201d to minor violations of assumptions, so since data are close to normal, it may be okay to do the parametric test. To be sure, do both the parametric and non-parametric test and compare.<\/p>\n<p><strong>ONE-WAY ANOVA, \u201cunstacked\u201d<\/strong><\/p>\n<ul>\n<li>Select <strong>Stat <\/strong>from the property bar, then choose <strong>ANOVA <\/strong>from the drop down menu.<\/li>\n<li>Choose <strong>One-Way (Unstacked)<\/strong><\/li>\n<li>Select variables (these must be in adjoining columns in your dataset)<\/li>\n<\/ul>\n<\/div>\n<div>\n<p><strong>Output (ANOVA table)<\/strong><\/p>\n<p><strong> <img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-35.png\" alt=\"\" width=\"675\" height=\"668\" class=\"alignnone size-full wp-image-426\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-35.png 675w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-35-300x297.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-35-65x64.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-35-225x223.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-35-350x346.png 350w\" sizes=\"auto, (max-width: 675px) 100vw, 675px\" \/><\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Interpretation: <\/strong>p&lt;0.05, so there is a significant difference in the groups. However, we don\u2019t know which groups are different.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Important<\/strong><strong>\u00a0<\/strong><strong>note:<\/strong><\/p>\n<p>p&lt;0.05, so groups are significantly different. However, there is no way to know from the simple One-Way ANOVA which group is different from which other(s). We need to do further testing to figure this out.<\/p>\n<p>Multiple Comparison Tests:<\/p>\n<p><strong>\u00a0<\/strong><strong>ANOVA is designed to tell us if there are any differences, but not which groups are different. For this: do Multiple Comparison Tests<\/strong><strong>\u00a0<\/strong><\/p>\n<table style=\"height: 259px\">\n<tbody>\n<tr style=\"height: 259px\">\n<td style=\"height: 259px;width: 1364.03px\">\n<div>\n<p><strong>Type<\/strong><strong> I Error Inflation:<\/strong><\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p>Recall that if we do multiple comparisons on the same data set (e.g. <strong>group 1 vs group 2<\/strong>, <strong>group 1 vs group 3 <\/strong>and <strong>group 2 vs group 3<\/strong>) then our probability of getting an incorrect interpretation goes up. If you have three groups, that probability goes up from 0.05 to 0.14 for all the comparisons taken together.<\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p>This means that if we want to compare the individual groups, we need to use a special test that calculates a \u201cfamily error rate\u201d (the error rate for all the groups considered together) rather than individual error rates. These are called \u201c<em>post-hoc\u201d<\/em> or \u201cmultiple comparison\u201d tests.<\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p>If you find a significant result with your ANOVA (i.e. if you run ANOVA and your p&lt;0.05), then you should run a multiple comparison test to see which of the groups are significantly different from each other.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<p>The most commonly used multiple comparison test is Tukey\u2019s Test. To do a Multiple Comparison test, click on \u201ccomparisons\u201d in the initial ANOVA menu<\/p>\n<ul>\n<li>Check the box for Tukey\u2019s test, and click OK<\/li>\n<li>Then run the ANOVA as before. (You will see the ANOVA output, and some additional information that lets you compare each group)<\/li>\n<\/ul>\n<p>Output:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-36.png\" alt=\"\" width=\"638\" height=\"641\" class=\"alignnone size-full wp-image-427\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-36.png 638w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-36-300x300.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-36-150x150.png 150w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-36-65x65.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-36-225x226.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-36-350x352.png 350w\" sizes=\"auto, (max-width: 638px) 100vw, 638px\" \/><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Interpretation: <\/strong>Look for groupings that have a different letter under the \u201cGrouping\u201d heading. In the example above, the pulse 1 and pulse 2 groups for the non-runners, and the pulse 1 group for the runners all have the same letter, so they are not significantly different. However, the pulse 2 data for the runners has a different letter, so that means it is significantly different than all the other groups. To find out how it is different, look at the column with the means: Pulse2ran clearly has a higher mean value than the other ones.<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Therefore, to report this trend, you\u2019d write something like:<\/strong><br \/>\nThere was a significant difference in pulse rates in the groups of students (ANOVA, p &lt;0.001). There was no difference in resting pulses of the two groups of students (runners vs non runners) prior to running, but there was a significant increase in pulse rate in the running group after running (Tukeys test, p&lt;0.05).<\/p>\n<h5>Comparing &gt;2 groups of data with subgroups<\/h5>\n<\/div>\n<div>\n<p><strong>Factorial Analysis of Variance\u00a0 <\/strong><strong>(This can be a 2-way ANOVA, 3-Way, etc.)<\/strong><\/p>\n<p><strong>\u00a0<\/strong>If your data contain distinct subgroups, you can run an ANOVA that lets you test the effects of those subgroups, or factors, on your response at the same time as testing your main grouping factor. This is called a factorial analysis.<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Important note: Minitab requires a \u201cbalanced design\u201d for 2-way ANOVA<\/strong><\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Consider an example where a researcher would like to know whether a particular feed supplement would increase growth in chickens, and whether the sex of the chicken would affect how it worked. This is a standard \u201ctwo-way\u201d design, where we are looking at two factors at the same time.<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Data<\/strong> <strong>Set-up<\/strong><\/p>\n<p>Set up the data in Minitab so that the response data (weight) are in one column, and the grouping factors are in other columns:<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p>This is known as a factorial table. If your data can be set up in this way to show groupings, then it is a good candidate for a two-way ANOVA.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Table 1. Weight (grams) after two weeks in a group of bantam chicks on two different diets; the standard diet, and one supplemented by blueberry extract.<\/p>\n<table>\n<tbody>\n<tr>\n<td>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>males<\/td>\n<td>supplement<\/p>\n<p>&nbsp;<\/p>\n<p>590<\/td>\n<td>control<\/p>\n<p>&nbsp;<\/p>\n<p>440<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>530<\/td>\n<td>570<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>550<\/td>\n<td>509<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>570<\/td>\n<td>510<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>650<\/td>\n<td>589<\/td>\n<\/tr>\n<tr>\n<td>females<\/td>\n<td>530<\/td>\n<td>550<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>580<\/td>\n<td>420<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>520<\/td>\n<td>440<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>520<\/td>\n<td>520<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td>560<\/td>\n<td>370<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<p><strong>Assumption<\/strong><strong> Testing:<\/strong><\/p>\n<ul>\n<li><strong>Normality (A-D): <\/strong>\n<ul>\n<li>control, female, p=0.64<\/li>\n<li>control, male, p=0.52<\/li>\n<li>supplement, female, p=0.21<\/li>\n<li>supplement, male, p=0.59<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><strong>All data are normally distributed<\/strong><\/p>\n<p><strong>Equal Variance:<\/strong><\/p>\n<ul>\n<li>Use the \u201cTest for equal variances\u201d in the ANOVA menu.<\/li>\n<li>Set it up as shown below:<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-37.png\" alt=\"\" width=\"731\" height=\"492\" class=\"alignnone size-full wp-image-428\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-37.png 731w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-37-300x202.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-37-65x44.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-37-225x151.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-37-350x236.png 350w\" sizes=\"auto, (max-width: 731px) 100vw, 731px\" \/><\/p>\n<p><strong>Equal Variance Output:<\/strong><\/p>\n<p><strong>p&gt;0.05, therefore accept the null <\/strong><strong>that variances are equal.<\/strong><\/p>\n<\/div>\n<div>\n<p><strong>Now<\/strong> <strong>run the test:<\/strong><\/p>\n<ul>\n<li><strong>\u00a0<\/strong>Select <strong>Stat <\/strong>from the property bar, then choose <strong>ANOVA,<\/strong> then <strong>Two-Way<\/strong><\/li>\n<li><strong>\u00a0<\/strong>Select your variables:\n<ul>\n<li><strong>Response = <\/strong>the data column, so, <strong>weight Row and column factors:<\/strong><\/li>\n<\/ul>\n<\/li>\n<li>Before doing a 2-way ANOVA, you should set up a table to assess your factors<\/li>\n<li>Use that to decide which is a row factor and which is a column factor.<\/li>\n<\/ul>\n<\/div>\n<div>\n<p>&nbsp;<\/p>\n<p><strong>First, plot the data to see the patterns you want to test:<\/strong><\/p>\n<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-38.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-430\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-38.png 576w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-38-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-38-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-38-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-38-350x233.png 350w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><strong><br \/>\n<\/strong><\/p>\n<div>\n<p><strong>Two Way ANOVA Output:<\/strong><\/p>\n<p><strong> <img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-39.png\" alt=\"\" width=\"485\" height=\"410\" class=\"alignnone size-full wp-image-431\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-39.png 485w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-39-300x254.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-39-65x55.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-39-225x190.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-39-350x296.png 350w\" sizes=\"auto, (max-width: 485px) 100vw, 485px\" \/><\/strong><\/p>\n<ul>\n<li><\/li>\n<\/ul>\n<p><strong>Interpretation:<\/strong><\/p>\n<p><strong>Step 1: look at p-values for individual factors<\/strong><\/p>\n<p>Sex: p=0.051<\/p>\n<ul>\n<li>Since p&gt;0.05, we accept the null and conclude there is no difference Food: p = 0.011\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 since p&lt;0.05, we reject the null and conclude there is a difference<\/li>\n<\/ul>\n<p><strong>Step 2: look at the p-value for interaction<\/strong><\/p>\n<p>Interaction: p=0.577,<\/p>\n<ul>\n<li>Since p&gt;0.05, there is no interaction with respect to the weight response among the factors (i.e. weight acted the same way for both sexes (was lower) regardless of the food type).<\/li>\n<\/ul>\n<p><strong>Interaction<\/strong> refers to whether the response acts the same way for the different levels of the factors.<\/p>\n<ul>\n<li>e.g. If the growth was higher on blueberries for males, and lower for blueberries for females, that would be an example of a different response for the two sexes&#8230; This would give a significant interaction.<\/li>\n<li>Since growth was lower for females than males for both food types, it means there was no interaction.<\/li>\n<\/ul>\n<p><strong>To see the actual trend, we look at the graph:<\/strong><\/p>\n<p>Chickens that were fed a diet supplemented by blueberry extract grew significantly larger than those on a standard diet (Two-Way ANOVA, p=0.012), and there was no significant difference in response between male and female chicks (Two-Way ANOVA, p=0.057). There was no significant interaction between food type and sex of chicks (Two-Way ANOVA, p=0.58), indicating that the food supplement affected weight in the same way for male and female chicks.<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p>Remember to always say which is higher\/lower than the other, if you can<\/p>\n<p>To report the interaction, be sure you explain it in English &#8211; do not just say there is a significant interaction or not.<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<h5>Relationships between variables: Regression and Correlation<\/h5>\n<p>Correlation analysis tests whether there is a correlation between two continuous variables Regression analysis takes it a step further to give us the equation of the line and give information on how much of the variation in the points can be statistically related to the other variable.<\/p>\n<p><strong>Step 1: Plot the data to see if there is a relationship between the variables <\/strong><strong>e.g. Is Weight related to Height of the students in the pulses study? If so, what is the equation of the line.<\/strong><\/p>\n<p><strong>\u00a0<\/strong>The graph suggests that the weight of the students increases as their heights increase.<\/p>\n<\/div>\n<div><\/div>\n<div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-40.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-432\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-40.png 576w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-40-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-40-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-40-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-40-350x233.png 350w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><\/p>\n<\/div>\n<div>\n<p>We can run <strong>correlation <\/strong>analysis to see how strongly related (correlated) the variables are, and we can run <strong>regression <\/strong>analysis to get the equation of the \u201cbest\u201d line that describes the relationship.<\/p>\n<p>We can use <strong>regression <\/strong>to describe relationships, or to generate an equation (usually called a \u201cmodel\u201d that can be used to predict for values<\/p>\n<p><strong>Assumptions: Correlation<br \/>\n<\/strong><\/p>\n<ul>\n<li>Each variable must be normally distributed<\/li>\n<li>The relationship must be linear<\/li>\n<li>The residuals (errors) must be normally distributed<\/li>\n<\/ul>\n<p><strong>Assumptions: Regression<\/strong><\/p>\n<ul>\n<li>Each variable must be normally distributed<\/li>\n<li>The relationship must be linear (for linear regression)<\/li>\n<li>The residuals must be evenly distributed along the line<\/li>\n<\/ul>\n<p><strong>Some<\/strong> <strong>important<\/strong> <strong>definitions:<\/strong><\/p>\n<ol>\n<li>Independent variable: the variable that is fixed for our analysis&#8230; what we are comparing the other variable to plot this variable on the x-axis<\/li>\n<li>Dependent variable: the variable that \u201cdepends on\u201d the x-axis variable; the one we expect to change or vary with our fixed variable. Plot this variable on the y-axis<\/li>\n<li>Residual: the amount of vertical (y-axis) variation of the observed points from the regression line (i.e. the \u2018y\u2019 value of your point minus the \u2018y\u2019 value on your line)<\/li>\n<li>Standardized residual: (residual) \u00f7 (standard deviation of residual). Standardized residuals have been standardized to have a variance of 1. Standardized residuals over 2 are usually considered large, and points with that large a residual are often considered to be outliers outside of our observed range.<\/li>\n<\/ol>\n<\/div>\n<div>\n<p><strong>Method:<\/strong><\/p>\n<ul>\n<li>Test normality as described before<\/li>\n<li>Test linearity by looking at the graph and deciding if the points form a straight line<\/li>\n<li>Test the normality and linearity of the residuals through specialized menus in the Regression menu.<\/li>\n<\/ul>\n<\/div>\n<div>\n<h5><strong>Correlation<\/strong><\/h5>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-41.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-433\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-41.png 576w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-41-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-41-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-41-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-41-350x233.png 350w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><\/p>\n<\/div>\n<div>\n<p><strong>The graph above shows a relationship between height and weight of the student participants.<\/strong><\/p>\n<ul>\n<li>How related are they?<\/li>\n<li>Is the relationship significant?<\/li>\n<\/ul>\n<p>To test these questions, we run a correlation analysis.<\/p>\n<p><strong>The correlation analysis gives us two statistics, <\/strong>the correlation coefficient, and the p-value<\/p>\n<\/div>\n<div>\n<p><strong>Correlation Coeffient <\/strong>( r ) tells you how related the two variables are on a scale of zero to one<\/p>\n<ul>\n<li>A negative \u2018r\u2019 means the relationship is negative (slopes downward)<\/li>\n<li>A positive \u2018r\u2019 means the relationship is positive (slopes upwards)<\/li>\n<li>A value of 1 or -1 means 100% related<\/li>\n<li>General rule of thumb is that a \u201cgood\u201d correlation is 0.7 to 1.0 or -0.7 to -1.0<\/li>\n<\/ul>\n<p><strong>P-value: <\/strong>Tells you whether the slope of your line is significantly different from zero (i.e do you have a significant relationship. If p&lt;0.05, the variables are significantly correlated<\/p>\n<p><strong>Carry out the test:<\/strong><\/p>\n<ul>\n<li>Select <strong>Stat <\/strong>from the property bar, then choose <strong>correlation <\/strong>from the <strong>Basic statistics <\/strong>drop down menu.<\/li>\n<\/ul>\n<p>[<strong>Note: <\/strong>do correlation analysis only if you don\u2019t need the equation of the line, or the coefficient of determination (r2)]<\/p>\n<p>&nbsp;<\/p>\n<p>With correlation, it doesn\u2019t matter what order you put the variables in, since we\u2019re just looking at a simple relationship.<\/p>\n<p><strong>Output from Minitab:<\/strong><strong>\u00a0<\/strong><\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>Correlations: height, weight<\/strong><\/p>\n<p>Pearson correlation of height and weight = 0.786 P-Value = 0.000<\/p>\n<p>remember to write as p&lt;0.001<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Interpretation:<\/strong><\/p>\n<ul>\n<li>The r-value = 0.786, which tells us that we have a positive correlation, and it is a \u201cgood\u201d correlation (since the value is above 0.7) tells us how strong it is.<\/li>\n<li>The p-value tells us if it is a significant correlation, while the r-value<\/li>\n<\/ul>\n<\/div>\n<div>\n<p><strong>Assumptions were satisfied:<\/strong><\/p>\n<p>Both sets of data are normal (based on prob. plots) and the relationship is linear (based on the graph)<\/p>\n<\/div>\n<div>\n<h5>Regression:<\/h5>\n<p>Regression analysis lets us determine the \u201cmodel\u201d (line) that best describes the data.<\/p>\n<p>The regression analysis gives us three statistics:<\/p>\n<ul>\n<li>\u00a0r2 =\u201ccoefficient of determination\u201d<\/li>\n<li>The p-value, and<\/li>\n<li>The equation of the line.<\/li>\n<\/ul>\n<p>The p-value is the same as for correlation<\/p>\n<\/div>\n<div>\n<p>The <strong>Coefficient of determination, r<\/strong><strong>2<\/strong><strong>, <\/strong>tells us the proportion of the variation in the \u2018y\u2019 vaues that can be directly related(&#8220;statistically\u201d) to the x-value. It is often referred to as the amount of variation \u201cexplained\u201d by the variation in \u2018x\u2019, but note that that is meant in the statistical sense.<\/p>\n<\/div>\n<div>\n<p>In regression, we assume that one variable is fixed or independent, and that the other variable depends on the first. The independent variable goes on the x-axis and the dependent one goes on the y-axis.<\/p>\n<p><strong>The assumptions of linear regression are:<\/strong><\/p>\n<ol>\n<li>Linearity: data on the graph form a reasonable line, so we say they are linear<\/li>\n<li>Data normality: Normality plots of the two variable showed that both were normally distributed<\/li>\n<li>Error normality (normality of residuals) &#8211; test this after running the regression<\/li>\n<li>Equal distribution of residuals along the line &#8211; test after running the regression<\/li>\n<\/ol>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>A note on regression assumptions:<\/strong><\/p>\n<p>Unlike the assumptions for the group comparisons (e.g. ANOVA or t-test), many of the Regression are there to guide your interpretation, not to be an absolute guide to whether you can do the test or not.<\/p>\n<p><strong>\u201cMust<\/strong> <strong>have\u201d<\/strong> asummptions:<\/p>\n<ul>\n<li>Data must be linear for Linear Regression<\/li>\n<li>Data must be normal or near normal<\/li>\n<\/ul>\n<p><strong>Assumptions<\/strong> <strong>that<\/strong> <strong>guide<\/strong> <strong>interpretation:<\/strong><\/p>\n<ul>\n<li>Spread of residuals:\n<ul>\n<li>Residuals are the distance that each dependent variable point is away from the line (on the y-axis).\n<ul>\n<li>For the regression equation to have good predictive ability (i.e. so you can predict values of y if you know values of x), the residuals have to be normally distributed and distributed equally above and below the line for the entire relationship. For example, if there is low variability (scatter) at the lower part of the graph, and high variability (scatter) at the high part of the graph, it means that the line doesn\u2019t predict as well at the upper parts of the graph.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>&#8211; i.e. Even if these assumptions are violoated slightly, you can still do the regression, but have to use caution if predicting \u2018y\u2019 from \u2018x\u2019<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p><strong>Run<\/strong> <strong>the Regression analysis:<\/strong><\/p>\n<ul>\n<li>Select <strong>Stat<\/strong>from the property bar, then choose <strong>Regression <\/strong>and <strong>Regression <\/strong>and <strong>fit regression model<\/strong><\/li>\n<li>In the regression menu, put the dependent variable in the response box, and the independent variable in the predictor box<\/li>\n<\/ul>\n<p><strong>To test the assumptions while doing the test, <\/strong><strong>click<\/strong> <strong>on<\/strong> <strong>the \u201cGraphs\u201d box.<\/strong><\/p>\n<div>\n<ul>\n<li>In the graphs window, check the box beside <strong>Regular<\/strong>\n<ul>\n<li>This gives the actual values for the residuals<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>[\u2018Standardized\u2019 will convert the values based on their standard deviations so you can see them as a function of their variability. This can be useful in identifying outliers, but otherwise, the graph will look very similar]<\/p>\n<ul>\n<li>For \u201c<strong>residual<\/strong> <strong>plots<\/strong>\u201d, check the boxes beside normal probability plot, and the residuals vs the \u201cfits\u201d (the independent var).\n<ul>\n<li>If you think that there may be a bias in the order the data are collected, then also plot the residuals vs the order (e.g. if you collect data over a period of time, so you might get a different response later in the day than earlier, that could be a bias)<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/div>\n<div>\n<h6>Interpretation of Regression Assumptions:<\/h6>\n<p><strong>Distribution of residuals along the line:<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-42.png\" alt=\"\" width=\"717\" height=\"485\" class=\"alignnone size-full wp-image-434\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-42.png 717w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-42-300x203.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-42-65x44.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-42-225x152.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-42-350x237.png 350w\" sizes=\"auto, (max-width: 717px) 100vw, 717px\" \/><\/p>\n<ul>\n<li>Here, you are looking for evidence that points are scattered fairly evenly along the horizontal line. This means your \u201cvariance\u201d (variability in the observed values compared to the predicted line) is spread equally along the line&#8230; analogous to the \u201cequal variance\u201d assumption in ANOVA.<\/li>\n<li>The distribution of points above and below the line isn\u2019t perfect, but is fairly uniform if you don\u2019t count the outlier point in the upper right corner.<\/li>\n<li>This distribution is close enough to pass the assumption test.<\/li>\n<\/ul>\n<p><strong>Normality of residuals:<\/strong><\/p>\n<p><strong> <img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-43.png\" alt=\"\" width=\"725\" height=\"486\" class=\"alignnone size-full wp-image-435\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-43.png 725w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-43-300x201.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-43-65x44.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-43-225x151.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-43-350x235.png 350w\" sizes=\"auto, (max-width: 725px) 100vw, 725px\" \/><\/strong><\/p>\n<p>This graph is just like the other normality plots we look at to see if data are normal. The computer will automatically determine the residuals for you, and plot them on the normality plot.<\/p>\n<p><strong>Interpretation:<\/strong><\/p>\n<ul>\n<li>The values are mostly on the line, again with one outlier, so conclude that they are normal<\/li>\n<li>[Just to be sure, I did a normality test on the data, and it confirm that they are normal (p&gt;0.05).]<\/li>\n<\/ul>\n<p><strong>The output from the Regression analysis:<\/strong><\/p>\n<p><strong> <img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-45.png\" alt=\"\" width=\"518\" height=\"676\" class=\"alignnone size-full wp-image-436\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-45.png 518w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-45-230x300.png 230w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-45-65x85.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-45-225x294.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-45-350x457.png 350w\" sizes=\"auto, (max-width: 518px) 100vw, 518px\" \/><\/strong><\/p>\n<p>Regression Analysis Interpretation:<\/p>\n<\/div>\n<div>\n<ol>\n<li>Equation of the line. This is in the form of a straight line, y = mx + b<\/li>\n<li>The r2 value tells us the proportion of the variability in weight that can be \u201cexplained\u201d (or is statistically related to) by height, so 62% of the weight variation is mathematically related to height. 38% is related to other, unmeasured factors.<\/li>\n<li>The p-value tests whether the slope of the line is significantly different from zero (if there is too much variability in the points, you can\u2019t be sure that you have a relationship).<\/li>\n<\/ol>\n<p><strong>The assumptions were satisfied, so we can generally trust our result:<\/strong><\/p>\n<ul>\n<li>There is a significant positive relationship between height and weight.<\/li>\n<li>The assumption testing tells us we have one outlier value (we could investigate that more closely if we wanted), and that there is a bit more variation at higher levels of \u2018x\u2019 (height) than at lower levels. Therefore, we know that our line does not predict as well for tall people as for short people.<\/li>\n<li>Although there is a significant relationship, there is still quite a bit of unexplained variation, so height is not the only factor that affects weight of the students.<\/li>\n<\/ul>\n<\/div>\n<div><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-46.png\" alt=\"\" width=\"576\" height=\"384\" class=\"alignnone size-full wp-image-437\" srcset=\"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-46.png 576w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-46-300x200.png 300w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-46-65x43.png 65w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-46-225x150.png 225w, https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-content\/uploads\/sites\/94\/2023\/05\/Chapter-4-Image-46-350x233.png 350w\" sizes=\"auto, (max-width: 576px) 100vw, 576px\" \/><\/div>\n<div>\n<p>Figure 1. Relationship between height and weight for a group of university students taking part in an exercise on how running affects pulse rates.<\/p>\n<table>\n<tbody>\n<tr>\n<td>\n<div>\n<p><strong>How do you report results of your regression analysis?<\/strong><\/p>\n<p>Your trend statement must give the pattern from your graph, then summarize the statistical results (remember to always plot your data before doing analysis, and include a figure legend). An example from the figure above might read:<\/p>\n<p>There was a strong positive relationship between height and weight for the university students taking part in the running exercise, so that as height increased, so did weight (Figure 1, Regression analysis, r2 = 0.617). About 62% of the variation in weight was statistically explained by the variation in height, but the calculated regression line had better predictive ability at lower levels of height than for higher levels of height (Figure 1).<\/p>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div>\n<p><strong>Pulses Dataset used for analyses in this manual<\/strong>.<\/p>\n<p>A group of students were separated into 2 groups. One group ran on the spot for one minute, and the other group did not. Following the trial, information was gathered on whether they smoked, what their sex was, what their height and weight were, and what their normal activity levels were.<\/p>\n<ul>\n<li>Pulse1 = resting pulse for all.<\/li>\n<li>Pulse2 = pulse following the running. (Beats per minute)<\/li>\n<li>Weight is in pounds, Height is in inches<\/li>\n<li>ran = 1 means they ran,<\/li>\n<li>ran = 2 means they did not.<\/li>\n<li>smokes = 1 means they smoked.<\/li>\n<li>sex = 1 is male<\/li>\n<li>sex = 2 is female<\/li>\n<li>activlev is their normal activity level; 1 is slight, 2 is moderate, and 3 is high.<\/li>\n<\/ul>\n<table>\n<tbody>\n<tr>\n<td>pulse1<\/td>\n<td>pulse2<\/td>\n<td>ran<\/td>\n<td>smokes<\/td>\n<td>sex<\/td>\n<td>height<\/td>\n<td>weight<\/td>\n<td>activ.level<\/td>\n<\/tr>\n<tr>\n<td>&nbsp;<\/p>\n<p>64<\/td>\n<td>&nbsp;<\/p>\n<p>78<\/td>\n<td>&nbsp;<\/p>\n<p>1<\/td>\n<td>&nbsp;<\/p>\n<p>2<\/td>\n<td>&nbsp;<\/p>\n<p>1<\/td>\n<td>&nbsp;<\/p>\n<p>66<\/td>\n<td>&nbsp;<\/p>\n<p>140<\/td>\n<td>&nbsp;<\/p>\n<p>2<\/td>\n<\/tr>\n<tr>\n<td>58<\/td>\n<td>75<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>72<\/td>\n<td>145<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>82<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>73.5<\/td>\n<td>160<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>66<\/td>\n<td>85<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>73<\/td>\n<td>190<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>64<\/td>\n<td>82<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>69<\/td>\n<td>155<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>74<\/td>\n<td>84<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>73<\/td>\n<td>165<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>84<\/td>\n<td>84<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>72<\/td>\n<td>150<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>72<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>74<\/td>\n<td>190<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>75<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>72<\/td>\n<td>195<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>76<\/td>\n<td>88<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>71<\/td>\n<td>138<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>80<\/td>\n<td>104<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>74<\/td>\n<td>160<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>80<\/td>\n<td>96<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>72<\/td>\n<td>155<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>72<\/td>\n<td>88<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>70<\/td>\n<td>153<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>76<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>67<\/td>\n<td>145<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>60<\/td>\n<td>76<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>71<\/td>\n<td>170<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>68<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>72<\/td>\n<td>175<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>66<\/td>\n<td>88<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>69<\/td>\n<td>175<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>70<\/td>\n<td>86<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>73<\/td>\n<td>170<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>80<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>74<\/td>\n<td>180<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>72<\/td>\n<td>80<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>66<\/td>\n<td>135<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>70<\/td>\n<td>106<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>71<\/td>\n<td>170<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>74<\/td>\n<td>76<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>70<\/td>\n<td>157<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>66<\/td>\n<td>102<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>70<\/td>\n<td>130<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>70<\/td>\n<td>98<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>75<\/td>\n<td>185<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>96<\/td>\n<td>140<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>61<\/td>\n<td>140<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>100<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>66<\/td>\n<td>120<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>78<\/td>\n<td>104<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>68<\/td>\n<td>130<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>82<\/td>\n<td>100<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>68<\/td>\n<td>138<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>88<\/td>\n<td>115<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>63<\/td>\n<td>121<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>112<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>70<\/td>\n<td>125<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>96<\/td>\n<td>116<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>68<\/td>\n<td>116<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>78<\/td>\n<td>118<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>69<\/td>\n<td>145<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>88<\/td>\n<td>110<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>69<\/td>\n<td>150<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>98<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>62.75<\/td>\n<td>112<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>80<\/td>\n<td>128<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>68<\/td>\n<td>125<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>62<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>74<\/td>\n<td>190<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>60<\/td>\n<td>62<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>71<\/td>\n<td>155<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>72<\/td>\n<td>70<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>69<\/td>\n<td>170<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>66<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>70<\/td>\n<td>155<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>76<\/td>\n<td>76<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>72<\/td>\n<td>215<\/td>\n<td>2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<table>\n<tbody>\n<tr>\n<td>pulse1<\/td>\n<td>pulse2<\/td>\n<td>ran<\/td>\n<td>smokes<\/td>\n<td>sex<\/td>\n<td>height<\/td>\n<td>weight<\/td>\n<td>activ.level<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>68<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>67<\/td>\n<td>150<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>54<\/td>\n<td>56<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>69<\/td>\n<td>145<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>74<\/td>\n<td>70<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>73<\/td>\n<td>155<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>74<\/td>\n<td>70<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>73<\/td>\n<td>155<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>68<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>71<\/td>\n<td>150<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>72<\/td>\n<td>73<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>68<\/td>\n<td>155<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>64<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>69.5<\/td>\n<td>150<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>82<\/td>\n<td>83<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>73<\/td>\n<td>180<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>64<\/td>\n<td>62<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>75<\/td>\n<td>160<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>58<\/td>\n<td>58<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>66<\/td>\n<td>135<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>54<\/td>\n<td>50<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>69<\/td>\n<td>160<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>70<\/td>\n<td>71<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>66<\/td>\n<td>130<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>61<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>73<\/td>\n<td>155<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>76<\/td>\n<td>76<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>74<\/td>\n<td>148<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>88<\/td>\n<td>84<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>73.5<\/td>\n<td>155<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>70<\/td>\n<td>70<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>70<\/td>\n<td>150<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>90<\/td>\n<td>89<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>67<\/td>\n<td>140<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>78<\/td>\n<td>76<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>72<\/td>\n<td>180<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>70<\/td>\n<td>71<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>75<\/td>\n<td>190<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>90<\/td>\n<td>90<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>68<\/td>\n<td>145<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>92<\/td>\n<td>94<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>69<\/td>\n<td>150<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>60<\/td>\n<td>63<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>1<\/td>\n<td>71.5<\/td>\n<td>164<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>72<\/td>\n<td>70<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>71<\/td>\n<td>140<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>68<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>72<\/td>\n<td>142<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>84<\/td>\n<td>84<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>69<\/td>\n<td>136<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>74<\/td>\n<td>76<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>67<\/td>\n<td>123<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>66<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>68<\/td>\n<td>155<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>84<\/td>\n<td>84<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>66<\/td>\n<td>130<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>61<\/td>\n<td>70<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>65.5<\/td>\n<td>120<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>64<\/td>\n<td>60<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>66<\/td>\n<td>130<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>94<\/td>\n<td>92<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>62<\/td>\n<td>131<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>60<\/td>\n<td>66<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>62<\/td>\n<td>120<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>72<\/td>\n<td>70<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>63<\/td>\n<td>118<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>58<\/td>\n<td>56<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>67<\/td>\n<td>125<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>88<\/td>\n<td>74<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>65<\/td>\n<td>135<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>66<\/td>\n<td>72<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>66<\/td>\n<td>125<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>84<\/td>\n<td>80<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>65<\/td>\n<td>118<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>62<\/td>\n<td>66<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>65<\/td>\n<td>122<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>66<\/td>\n<td>76<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>65<\/td>\n<td>115<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>80<\/td>\n<td>74<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>64<\/td>\n<td>102<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>78<\/td>\n<td>78<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>67<\/td>\n<td>115<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>68<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>69<\/td>\n<td>150<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>72<\/td>\n<td>68<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>68<\/td>\n<td>110<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>82<\/td>\n<td>80<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>63<\/td>\n<td>116<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>76<\/td>\n<td>76<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>62<\/td>\n<td>108<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>87<\/td>\n<td>84<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>63<\/td>\n<td>95<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>90<\/td>\n<td>92<\/td>\n<td>2<\/td>\n<td>1<\/td>\n<td>2<\/td>\n<td>64<\/td>\n<td>125<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>78<\/td>\n<td>80<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>68<\/td>\n<td>133<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>68<\/td>\n<td>68<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>62<\/td>\n<td>110<\/td>\n<td>2<\/td>\n<\/tr>\n<tr>\n<td>86<\/td>\n<td>84<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>67<\/td>\n<td>150<\/td>\n<td>3<\/td>\n<\/tr>\n<tr>\n<td>76<\/td>\n<td>76<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>2<\/td>\n<td>61.75<\/td>\n<td>108<\/td>\n<td>2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n","protected":false},"author":116,"menu_order":4,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-50","chapter","type-chapter","status-publish","hentry"],"part":3,"_links":{"self":[{"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/pressbooks\/v2\/chapters\/50","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/wp\/v2\/users\/116"}],"version-history":[{"count":21,"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/pressbooks\/v2\/chapters\/50\/revisions"}],"predecessor-version":[{"id":438,"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/pressbooks\/v2\/chapters\/50\/revisions\/438"}],"part":[{"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/pressbooks\/v2\/parts\/3"}],"metadata":[{"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/pressbooks\/v2\/chapters\/50\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/wp\/v2\/media?parent=50"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/pressbooks\/v2\/chapter-type?post=50"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/wp\/v2\/contributor?post=50"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.library.upei.ca\/bio3310\/wp-json\/wp\/v2\/license?post=50"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}