- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
生物统计13相关与回归分析
醫資系 Lecture#13 p. PAGE 14
生物統計學 96-1 200
生物統計學13:相關係數與回歸分析
0
0. Definition
Before we conduct any type of analysis, we should always create a two-way scatter plot of the data. Correlation is the quantification strength of linear association among continuous random variables.
1. Pearson’s Correlation Coefficient
- The linear relationship between two r.v.s X and Y is denote byρ(rho), which is the average of the product of the standard normal deviates of X and Y.
- The estimator of the population correlation is known as Pearson’s coefficient of correlation, which is denoted by r.
ρ= E {[(X –μX )/σX][(Y –μY )/σY]}, population correlation
r = =, sample correlation
=
- Note that, -1≤ r ≤ 1, and r = 1 or -1 when there is an exact linear relationship between x and y. As the relationship deviates from perfect linearity, r moves away from ±1 and closer to 0.
- If y increases as x increases, r 0, positively correlated;
if y decreases as x increases, r 0, negatively correlated.
(1) Linear relationship, invalid for a nonlinear relationship.
(2) Highly sensitive to outliers, extreme values.
(3) Not to be extrapolated beyond the observed ranges of the variables.
(4) The correlation coefficient merely tells us that a linear relationship exists; it does not specify whether the relationship is cause-and-effect.
1.1 Hypothesis Testing
- Evaluate whether any correlation exists between the r.v.s X and Y;
i.e., testing H0 :ρ= 0 (there is no association) vs H1 :ρ≠ 0.
Test statistic T = = ,
where the estimated standard error of r is .
- Under H0, the quantity T ~ t(n – 2), if X and Y are normally distributed.
(Valid only for testing H0 :ρ= 0, not for H0 :ρ=ρ0 → sampling dist of r is skewed)
2. Spearman’s Rank Correlation Coefficient
- Pearson’s correlation is sensitive to outliers, like other parametric techniques;
- Spearman’s rank correlation calculates the correlation based on the ranked values of x and y a nonparametric method.
- A measure of concordance of the ranks for the o
文档评论(0)