帮我看看这个logistic regression output包子谢 - Statistics版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Statistics版 - 帮我看看这个logistic regression output包子谢

相关主题
● 弱问一个SAS里面求adjusted means的问题	● ##面试过了，请教问题##
● ks 只有28%	● model和variables都sig.但每个category都不sig
● 请教如何用R做Cox model的k-fold cross-validation	● 请教ㄧ个modeling问题
● 紧急求助，logistic regression	● any regression model with high prediction accuracy?
● 急问：用stata或R算predicted probabiltiy (logistic regressi	● [急求助] survival analysis （SAS）
● 做logistic regression，cases很少但是predictor很多	● 关于stepwise programming
● 抓狂！为啥选出来的predictor都这么差	● 请问sas中一个变量的内容被两个左斜杠（/）分成了三部分
● 请教如何分析一个case control study。	● 怎样用R定位变量的位置

相关话题的讨论汇总
话题: empty话题: obs话题: predicts话题: perfectly话题: note

进入Statistics版参与讨论

1

(共1页)

p********1 发帖数: 1011	1 一共6个independent variables。从这个output上看，哪些variables是好的 predictors? 开始的那些note“!= 0 predicts failure perfectly” 是什么意思？怎么解决？ logistic y i.var1 i.var2 i.var3 i.expect i.pwar i.open note: 1.var1 != 0 predicts failure perfectly 1.var1 dropped and 14539 obs not used note: 2.var1 != 0 predicts failure perfectly 2.var1 dropped and 4903 obs not used note: 4.var2 != 0 predicts failure perfectly 4.var2 dropped and 96 obs not used note: 4.var3 != 0 predicts failure perfectly 4.var3 dropped and 235 obs not used note: 5.var1 omitted because of collinearity Logistic regression Number of obs = 28997 LR chi2(11) = 59.25 Prob > chi2 = 0.0000 Log likelihood = -1125.0894 Pseudo R2 = 0.0257 ------------------------------------------------------------ y \| Odds Ratio P>\|z\| [95% Conf. Interval] -------------+---------------------------------------------- var1 \| 1 \| 1 (empty) 2 \| 1 (empty) 3 \| 8.325762 0.081 .7696349 90.0665 4 \| 1.766659 0.595 .2168463 14.39307 5 \| 1 (omitted) \| var2 \| 2 \| .1553978 0.032 .0282502 .8548058 3 \| 1.110723 0.673 .6820567 1.808801 4 \| 1 (empty) 5 \| 1.190851 0.833 .2353377 6.025916 \| var3 \| 2 \| 6.190578 0.023 1.287397 29.768 3 \| .9674644 0.836 .7076692 1.322634 4 \| 1 (empty) 5 \| 18.59003 0.000 3.944853 87.60513 \| 1.expect \| 1.710305 0.001 1.255682 2.329524 1.pwar \| .7200235 0.039 .5271828 .9834044 1.open \| .9907067 0.959 .694594 1.413055 _cons \| .0009225 0.000 .0000842 .0101118 -----------------------------------------------------------
D*G 发帖数: 471	2 哇我又来赚包子了。前面那个是说var1和你的response variable完全相关。 Var1>0的时候response一直都是0。 Google “complete separation”。 var1应该扔掉。后面太乱了没法看。【在 p********1 的大作中提到】 : 一共6个independent variables。从这个output上看，哪些variables是好的 : predictors? : 开始的那些note“!= 0 predicts failure perfectly” : 是什么意思？怎么解决？ : logistic y i.var1 i.var2 i.var3 i.expect i.pwar i.open : note: 1.var1 != 0 predicts failure perfectly : 1.var1 dropped and 14539 obs not used : note: 2.var1 != 0 predicts failure perfectly : 2.var1 dropped and 4903 obs not used : note: 4.var2 != 0 predicts failure perfectly
p********1 发帖数: 1011	3 又是你啊，谢谢！我重新复制了一遍表格，把standard error和z两栏删了，就剩下 odds ratio, p, confidence interval了。（不过之前的table还在底下删不掉）。能帮我看看这个table，哪些variables是predictors? 为什么？（看哪个参数得出的结论？）【在 D*G 的大作中提到】 : 哇我又来赚包子了。 : 前面那个是说var1和你的response variable完全相关。 Var1>0的时候response一直都 : 是0。 Google “complete separation”。 var1应该扔掉。 : 后面太乱了没法看。
p********1 发帖数: 1011	4 Logistic regression Number of obs = 28997 LR chi2(11) = 59.25 Prob > chi2 = 0.0000 Log likelihood = -1125.0894 Pseudo R2 = 0.0257 ------------------------------------------------------------ y \| Odds Ratio P>\|z\| [95% Conf. Interval] -------------+---------------------------------------------- var1 \| 1 \| 1 (empty) 2 \| 1 (empty) 3 \| 8.325762 0.081 .7696349 90.0665 4 \| 1.766659 0.595 .2168463 14.39307 5 \| 1 (omitted) \| var2 \| 2 \| .1553978 0.032 .0282502 .8548058 3 \| 1.110723 0.673 .6820567 1.808801 4 \| 1 (empty) 5 \| 1.190851 0.833 .2353377 6.025916 \| var3 \| 2 \| 6.190578 0.023 1.287397 29.768 3 \| .9674644 0.836 .7076692 1.322634 4 \| 1 (empty) 5 \| 18.59003 0.000 3.944853 87.60513 \| 1.expect \| 1.710305 0.001 1.255682 2.329524 1.pwar \| .7200235 0.039 .5271828 .9834044 1.open \| .9907067 0.959 .694594 1.413055 _cons \| .0009225 0.000 .0000842 .0101118 -----------------------------------------------------------
D*G 发帖数: 471	5 不太熟悉spss的output。。我觉得你需要先解决complete separation的问题再看结果。
p********1 发帖数: 1011	6 这是stata的output，不是spss的。我的问题还是：当sample size大的时候，到底是看哪个参数：odds ratio, p, confidence interval？分别怎么看（eg. cutoff)？果。【在 D*G 的大作中提到】 : 不太熟悉spss的output。。我觉得你需要先解决complete separation的问题再看结果。
D*G 发帖数: 471	7 我觉得固定的strategy是看每个variable的overall p-value。剩下的都是具体问题具体分析。如果你有太多var都significant，就有可能是sample size太大，导致很小的差异也被 detect到了。如果你的model不关心很小的差异，这时候你就可以考虑只留下category之间差异比较明显的variable，也就是OR比较大的。可以把简化的model和未简化的model用 likelihood ratio test来比较。如果差异不大，就可以用简化的model。你再听听其他人意见吧，这是我知道的，说不定其他人意见不一样。【在 p********1 的大作中提到】 : 这是stata的output，不是spss的。 : 我的问题还是：当sample size大的时候，到底是看哪个参数：odds ratio, p, : confidence interval？ : 分别怎么看（eg. cutoff)？ : : 果。
y*****w 发帖数: 1350	8 "可以把简化的model和未简化的model用 likelihood ratio test来比较。如果差异不大，就可以用简化的model。" --- How to define "差异不大"? Do those likelihood ratio tests have to strictly follow the Chi-Square probabilities? For example, if a likelihood ratio test has a p-value of 0.045, should the full model still be applied? 【在 D*G 的大作中提到】 : 我觉得固定的strategy是看每个variable的overall p-value。剩下的都是具体问题具 : 体分析。 : 如果你有太多var都significant，就有可能是sample size太大，导致很小的差异也被 : detect到了。如果你的model不关 : 心很小的差异，这时候你就可以考虑只留下category之间差异比较明显的variable，也 : 就是OR比较大的。 : 可以把简化的model和未简化的model用 likelihood ratio test来比较。如果差异不 : 大，就可以用简化的model。 : 你再听听其他人意见吧，这是我知道的，说不定其他人意见不一样。
p********1 发帖数: 1011	9 谢谢回复。包子已发！【在 D*G 的大作中提到】 : 我觉得固定的strategy是看每个variable的overall p-value。剩下的都是具体问题具 : 体分析。 : 如果你有太多var都significant，就有可能是sample size太大，导致很小的差异也被 : detect到了。如果你的model不关 : 心很小的差异，这时候你就可以考虑只留下category之间差异比较明显的variable，也 : 就是OR比较大的。 : 可以把简化的model和未简化的model用 likelihood ratio test来比较。如果差异不 : 大，就可以用简化的model。 : 你再听听其他人意见吧，这是我知道的，说不定其他人意见不一样。

1

(共1页)

进入Statistics版参与讨论

相关主题
● 怎样用R定位变量的位置	● 急问：用stata或R算predicted probabiltiy (logistic regressi
● 怎样用R除掉DUPLICATED RECORD	● 做logistic regression，cases很少但是predictor很多
● [合集] └ Re: 关于stepwise programming	● 抓狂！为啥选出来的predictor都这么差
● Ask 2 simple SAS questions,thanks	● 请教如何分析一个case control study。
● 弱问一个SAS里面求adjusted means的问题	● ##面试过了，请教问题##
● ks 只有28%	● model和variables都sig.但每个category都不sig
● 请教如何用R做Cox model的k-fold cross-validation	● 请教ㄧ个modeling问题
● 紧急求助，logistic regression	● any regression model with high prediction accuracy?

相关话题的讨论汇总
话题: empty话题: obs话题: predicts话题: perfectly话题: note

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)