two_basic_statistics

	PLS 401 Senior Seminar in Political Science Roger C. Lowery, Ph.D.	Two Basic Summary Statistics

I. Statistical association (or correlation) measures the strength (or weakness) of the relationship between two variables for cross tabulations, we will always use the correlation coefficient (can vary between 0 to ±1) a positive (or direct) correlation indicates that as the values of one variable increase, so do the values of the other variable a negative (or inverse) correlation indicates that as the values of one variable increase, the values of the other variable decrease the closer either correlation coefficient is to zero, the weaker the relationship in social-science research, the following classification of correlation coefficients is commonly used: a weak relationship is present if Tau-B is less than plus or minus 0.10 a moderate relationship is present if Tau-B is between plus or minus 0.10 and 0.25 a strong relationship is present if Tau-B is greater than plus or minus 0.25 statistical association is not necessarily the same thing as causation i.e., just because there is present a weak, moderate, or strong level of statistical association between two variables does not necessarily mean that changes in one variable cause changes observed in the other variable statistical association just means that the values of one variable change consistently with changes in the values of the other variable it may be a third variable that accounts for (or causes) the changes in the two variables that you are measuring II. Statistical significance measures the probability of random-sampling error in survey data random-sampling error causes the observed patterns that we see in sample data to be unrepresentative of the actual pattern that we would see if we had looked at the whole population from which the sample was randomly drawn the good news with random sampling is that we can accurately estimate population patterns by looking at only a relatively small sample randomly drawn from a much larger population the bad news is that any randomly-drawn sample can be unrepresentative of the population from which it is drawn however, the good news is that the probability of having an unrepresentative sample can be calculated (this is what the Chi-square statistic tells us) in this course, we will use the Chi-square statistic to determine the probability of random-sampling error in social-science research, the following classification of Chi-square probability is commonly used: if the Chi-square probability of random-sample error is less than 0.05, then the sample results are assumed to be statistically significant (i.e., there is less than a 5% chance that our observed sample results are unrepresentative of the population results) if the Chi-square probability of random-sample error is equal to or greater than 0.05, then the sample results are assumed to be *statistically not* significant (i.e., there is a 5% chance or more that our observed sample results are misleading us) the Chi-square probability** is reported one of two ways: sometimes an asterisk is used if there is an asterisk next to the correlation coefficient, then the sample results are considered statistically significant if there is no asterisk, then the sample results are considered statistically not significant or, sometimes the actual Chi-square probability is given if the Chi-square probability is less than 0.05, then the sample results are considered statistically significant, if the Chi-square probability is equal to or greater than 0.05, then the sample results are considered *statistically not* significant, statistical significance** is not the same thing as substantive significance: for example, national-sample surveys repeatedly show that there is no statistically significant difference between Protestants (as a group) and Catholics (as a group) in their attitudes toward legalizing abortion as we can see in the 2004 NES survey results in the table below, roughly the same proportion of Catholics as Protestants support legalizing abortion); the relationship is weak (the Tau-B is less than 0.10) and any sample differences are not statistically significant (there is no asterisk, indicating that the Chi-square probability of random sampling error is equal to or greater than 5% or 0.05 -- the Chi-square probability for this cross tabulation is actually 0.38) such a finding (a weak correlation that is not statistically significant) is nevertheless substantively significant, given the strong anti-abortion stand taken by the Roman Catholic Church's leadership.
ATTITUDES TOWARD LEGALIZING ABORTION by RELIGIOUS IDENTIFICATION Tau-B: 0.091; Chi-square prob.: 0.383 Protestant Catholic Missing TOTAL OUTLAW ALL ABORTIONS 98 36 5 134 16.8% 14.1% 16.0% LIMIT SOME ABORTIONS 310 132 75 442 53.1% 51.6% 52.6% ALLOW ABORTION FOR ANY REASON 176 88 127 264 30.1% 34.4% 31.4% Missing 88 36 41 156 TOTAL 584 256 239 840 100.0% 100.0% 100.0%