Содержание
- 2. Causation
- 3. Causation Causation is any cause that produces an effect. This means that when something happens (cause)
- 4. Correlation Correlation measures the relationship between two things. Positive correlations happen when one thing goes up,
- 5. Correlation Correlations happen when: A causes B B causes A A and B are consequences of
- 6. Causation and Correlation Causation and correlation can happen at the same time. But having a correlation
- 7. Correlation or Causation? As people’s happiness level increases, so does their helpfulness. This would be a
- 8. Correlation or Causation? Dogs pant to cool themselves down. This would be a causation. When a
- 9. Correlation or Causation? Among babies, those who are held more tend to cry less. This would
- 10. Let's think of our own Correlation: Causation:
- 11. Quick Review Causation is any cause that produces an effect. Correlation measure the relationship between two
- 12. Correlation
- 13. The Question Are two variables related? Does one increase as the other increases? e. g. skills
- 14. Scatterplots Graphically depicts the relationship between two variables in two dimensional space.
- 15. Direct Relationship
- 16. Inverse Relationship
- 17. An Example Does smoking cigarettes increase systolic blood pressure? Plotting number of cigarettes smoked per day
- 18. Trend?
- 19. Smoking and BP Note relationship is moderate, but real. Why do we care about relationship? What
- 20. Heart Disease and Cigarettes Data on heart disease and cigarette smoking in 21 developed countries Data
- 21. The Data Surprisingly, the U.S. is the first country on the list--the country with the highest
- 22. Scatterplot of Heart Disease CHD Mortality goes on Y axis Why? Cigarette consumption on X axis
- 23. {X = 6, Y = 11}
- 24. What Does the Scatterplot Show? As smoking increases, so does coronary heart disease mortality. Relationship looks
- 25. Correlation Co-relation The relationship between two variables Measured with a correlation coefficient Most popularly seen correlation
- 26. Types of Correlation Positive correlation High values of X tend to be associated with high values
- 27. Correlation Coefficient A measure of degree of relationship. Between 1 and -1 Sign refers to direction.
- 29. Covariance The formula for co-variance is: How this works, and why? When would covXY be large
- 30. Example
- 31. Example What the heck is a covariance? I thought we were talking about correlation?
- 32. Correlation Coefficient Pearson’s Product Moment Correlation Symbolized by r Covariance ÷ (product of the 2 SDs)
- 33. Calculation for Example CovXY = 11.12 sX = 2.33 sY = 6.69
- 34. Example Correlation = .713 Sign is positive Why? If sign were negative What would it mean?
- 35. Factors Affecting r Range restrictions Looking at only a small portion of the total scatter plot
- 36. Factors Affecting r Outliers Overestimate Correlation Underestimate Correlation
- 37. Countries With Low Consumptions
- 38. Outliers
- 39. Testing Correlations So you have a correlation. Now what? In terms of magnitude, how big is
- 40. Regression
- 41. „Regression” refers to the process of fitting a simple line to datapoints, Historically, linear regression was
- 42. What is regression? How do we predict one variable from another? How does one variable change
- 43. Linear Regression A technique we use to predict the most likely score on one variable from
- 44. Linear Regression: Parts Y - the variables you are predicting i.e. dependent variable X - the
- 45. Why Do We Care? We may want to make a prediction. More likely, we want to
- 46. An Example Cigarettes and CHD Mortality again Data repeated on next slide We want to predict
- 47. The Data Based on the data we have what would we predict the rate of CHD
- 48. For a country that smokes 6 C/A/D… We predict a CHD rate of about 14 Regression
- 49. Regression Line Formula = the predicted value of Y (e.g. CHD mortality) X = the predictor
- 50. Regression Coefficients “Coefficients” are a and b b = slope Change in predicted Y for one
- 51. Calculation Slope Intercept
- 52. For Our Data CovXY = 11.12 s2X = 2.332 = 5.447 b = 11.12/5.447 = 2.042
- 53. Note: The values we obtained are shown on printout. The intercept is the value in the
- 54. Making a Prediction Second, once we know the relationship we can predict We predict 22.77 people/10,000
- 55. Accuracy of Prediction Finnish smokers smoke 6 C/A/D We predict: They actually have 23 deaths/10,000 Our
- 56. Cigarette Consumption per Adult per Day 12 10 8 6 4 2 CHD Mortality per 10,000
- 57. Residuals When we predict Ŷ for a given X, we will sometimes be in error. Y
- 58. Minimizing Residuals Again, the problem lies with this definition of the mean: So, how do we
- 60. Скачать презентацию