弱问，为啥取了log之后得到相反结论 - Statistics版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Statistics版 - 弱问，为啥取了log之后得到相反结论

相关主题
● Maximum Likelihood Estimator	● 请问：bootstrap的应用范围
● 统计的学问 (转载)	● 谁给解释一下“sampling distribution”
● Regression model 不用 test normality？	● 诚心请教一个统计问题
● 问一个关于skewness的问题	● 请教：怎么求几组样本的总体均值的方差？
● 一道概率题	● 请教：怎么求几组样本的总体方差？
● 求助：为什么box-cox transformed 后的varibale 不normal	● 请教：一个总体的两个独立组取样后，总体的方差怎么求？
● 请问各位能人一个data transformation的问题	● 问2个统计检验的问题
● 问题请教: 如果样本数量是3, 怎么做test?	● 请教如何解这个qualifying exam 问题

相关话题的讨论汇总
话题: log话题: mu1话题: normal话题: mu2

进入Statistics版参与讨论

1

(共1页)

s********u 发帖数: 344	1 比较两个样本的均值。样本说大不大，说小不小， 1000左右，完全不是正态分布。因为样本还算大，能够用上central limit theorem 吧。那就z检验吧，得到两个样本均值是不同的结论 p=0.014。没事有试了一下log transformation，看上去正态多了，结果得到了p=0.24。也看过有些文献说做了transformation很多统计量会变，但是这样的结果我该怎么解释呐？
I*****a 发帖数: 5425	2 Maybe because the distribution of means are not normal enough ? How about looking at the their distributions using bootstrap. 【在 s********u 的大作中提到】 : 比较两个样本的均值。样本说大不大，说小不小， 1000左右，完全不是正态分布。因 : 为样本还算大，能够用上central limit theorem 吧。那就z检验吧，得到两个样本均 : 值是不同的结论 p=0.014。 : 没事有试了一下log transformation，看上去正态多了，结果得到了p=0.24。 : 也看过有些文献说做了transformation很多统计量会变，但是这样的结果我该怎么解释 : 呐？
s*r 发帖数: 2757	3 log transformation actually checks whether the ratio of 2 geometric mean is 1. i guess you will get similar p if you use permutation test to get an empirical null 【在 s********u 的大作中提到】 : 比较两个样本的均值。样本说大不大，说小不小， 1000左右，完全不是正态分布。因 : 为样本还算大，能够用上central limit theorem 吧。那就z检验吧，得到两个样本均 : 值是不同的结论 p=0.014。 : 没事有试了一下log transformation，看上去正态多了，结果得到了p=0.24。 : 也看过有些文献说做了transformation很多统计量会变，但是这样的结果我该怎么解释 : 呐？
r********2 发帖数: 19	4 既然用了clt，我们应该不需要原分布是normal的吧。这个是应该可以预计到的结果。log transformation 对不同值转换的效果不一样，在 >1的range 里，使得两个数的距离变小。所以变得不显著了啊。但是两个test是在检验不同的mean。 (司徒丢丢) 的大作中提到: 】
P****D 发帖数: 11146	5 因为取对数之后你检验的是两个样本均值的商，这跟你检验两个样本均值的差，能一样嘛。
a*****3 发帖数: 601	6 correct 【在 P****D 的大作中提到】 : 因为取对数之后你检验的是两个样本均值的商，这跟你检验两个样本均值的差，能一样 : 嘛。
j*****e 发帖数: 182	7 Based on what you said "没事有试了一下log transformation，看上去正态多了，结果得到了p=0.24。" log_normal can be used to model the two distributions. Let's say the first distribution is log_normal (mu1, sigma1), the second distribution is log_normal(mu2,sigma2). Since your p-value is large (0.24), mu1 is not sig. different from mu2. The mean of the fist log_normal is exp(mu1+0.5sigma12). The mean of the second log_normal is exp(mu2+0.5sigma**2). You said "那就z检验吧，得到两个样本均值是不同的结论 p=0.014。 This implies sigma1 is not equivalent to sigma2. In summary, you have two distributions (possible positive distributions) skewed to the right. They have the same median (exp(mu1)=exp(mu2)). Their means are different, because the scale parameters on the log_scale are different. There is nothing wrong the CLT.
h****i 发帖数: 79	8 Good answer. Mark! 【在 j****e 的大作中提到】 : Based on what you said "没事有试了一下log transformation，看上去正态多了，结 : 果得到了p=0.24。" : log_normal can be used to model the two distributions. : Let's say the first distribution is log_normal (mu1, sigma1), the second : distribution is log_normal(mu2,sigma2). : Since your p-value is large (0.24), mu1 is not sig. different from mu2. : The mean of the fist log_normal is exp(mu1+0.5sigma1*2). The mean of the : second log_normal is exp(mu2+0.5sigma**2). : You said "那就z检验吧，得到两个样本均值是不同的结论 p=0.014。 : This implies sigma1 is not equivalent to sigma2.
z******n 发帖数: 397	9 有道理，不过因为log_normal的median是exp(mu)，所以从检验mu1=mu2的目标来看，可以用非参数检验来test变换过后data的median。对变换过后的data用z检验，零假设就变了。【在 j****e 的大作中提到】 : Based on what you said "没事有试了一下log transformation，看上去正态多了，结 : 果得到了p=0.24。" : log_normal can be used to model the two distributions. : Let's say the first distribution is log_normal (mu1, sigma1), the second : distribution is log_normal(mu2,sigma2). : Since your p-value is large (0.24), mu1 is not sig. different from mu2. : The mean of the fist log_normal is exp(mu1+0.5sigma1*2). The mean of the : second log_normal is exp(mu2+0.5sigma**2). : You said "那就z检验吧，得到两个样本均值是不同的结论 p=0.014。 : This implies sigma1 is not equivalent to sigma2.

1

(共1页)

进入Statistics版参与讨论

相关主题
● 请教如何解这个qualifying exam 问题	● 一道概率题
● [合集] 请教关于大样本下分布的比较的问题	● 求助：为什么box-cox transformed 后的varibale 不normal
● 请牛人帮帮忙	● 请问各位能人一个data transformation的问题
● a simulation problem	● 问题请教: 如果样本数量是3, 怎么做test?
● Maximum Likelihood Estimator	● 请问：bootstrap的应用范围
● 统计的学问 (转载)	● 谁给解释一下“sampling distribution”
● Regression model 不用 test normality？	● 诚心请教一个统计问题
● 问一个关于skewness的问题	● 请教：怎么求几组样本的总体均值的方差？

相关话题的讨论汇总
话题: log话题: mu1话题: normal话题: mu2

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)