Regression中噪音项是一个AR(1)，如何做MLE或者其它Fit？ - DataSciences版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

DataSciences版 - Regression中噪音项是一个AR(1)，如何做MLE或者其它Fit？

相关主题
● 讨论一下：几种clustering方法的特点，区别，长处各是什么？	● 问一道面试题
● 机器学习日报2015年2月楼	● 报面筋求实习合租 (转载)
● ［请教］一个R问题	● 也问个模型
● 请问DS的面试主要要准备什么？	● 借版面问个machine learning的问题
● machine learning engineer VS. Data Analytics/BI	● weka有支持regression tree的random forest吗 (转载)
● 求教分类问题中预测概率的问题	● Re: 攒人品，发Google Statistician/Data Scientist电面面经
● 请问关于小的dataset evaluation的问题	● 请教一个题
● [IT+IEOR背景]请教一下往DtSci方向的职业发展建议	● SAS regression运行时间太长

相关话题的讨论汇总
话题: rho话题: beta话题: star话题: mle话题: regression

进入DataSciences版参与讨论

1

(共1页)

c*******e 发帖数: 150	1 【以下文字转载自 Statistics 讨论区】发信人: cavaliere (Un Baiser S'il Vous Plaît), 信区: Statistics 标题: Regression中噪音项是一个AR(1)，如何做MLE或者其它Fit？发信站: BBS 未名空间站 (Mon Sep 15 22:02:42 2014, 美东) 想请教一下版上的各位大牛们，如果 Linear Regression中Noise Term是一个AR(1) process，通常都有什么成熟的算法做 MLE 或者其它方法 fit ？具体的说，模型可以表示为 Y(t) = X(t) \dot \beta + E(t), X(t) 和 \beta 都是 K-维的向量，其它都是标量。 t = 1, 2, 3, ..., T 是手头的 sample，但是和经典的 Linear Regression 不同，E(t) 不是 i.i.d. 的高斯白噪音，可以假定 E(t) 服从一下 model: E(t) = \rho * E(t-1) + \sigma * Z(t) \rho 和 \sigma 是 unknown parameter，Z(t) 可以认为是高斯白噪音。所以全部的 parameters 包括向量 \beta 和标量 \sigma, \rho 最好还是 maximum-likelihood 的方法，这样我可以保留后面做 log-Likelihood Ratio Test 的可行性，以便于做 model comparison/selection 简单地做了一下 google 和 literature survey，也许是我搜寻用的关键字不对，没有找到什么有用的材料 -_- 谢谢各位好心的大侠指点啦！
Y****a 发帖数: 243	2 Y(t)- rho Y(t-1) = bata (X(t) - rho X(t-1)) + e where e is iid normal(0,sigma^2) apply EM algorithm to estimate beta and rho. 1. initial value rho = 0 => beta(hat) 2. plug in beta(hat), transform your Y and X, estimate rho(hat) 3. repeat steps 1 & 2 until converge.
s*********i 发帖数: 218	3 Try Dynamic Regression function in R
c*******e 发帖数: 150	4 Awesome. Upon doing further survey on this topic, I also think this is the best solution. Out of curiosity, may I ask a further questions: given the sample X(t) and Y(t), suppose that beta_star(rho_star) maximized the likehihood function of all beta given that rho == rho_star, and rho_star(beta_star) maximized the likelihood function of all rho given that beta == beta_star, namely this pair of beta_star(rho_star) and rho_star(beta_star) is the fixed-point which we converged at step (3), is there any theoretical guarantee that this pair [beta_star, rho_star] is the global maximum-likelihood estimator (MLE)? or there could be counter-examples that there could a gap from the global maximum, and we need to be careful when applying properties of the MLE to the obtained estimators. thanks very much! 【在 Y****a 的大作中提到】 : Y(t)- rho Y(t-1) = bata (X(t) - rho X(t-1)) + e : where e is iid normal(0,sigma^2) : apply EM algorithm to estimate beta and rho. : 1. initial value rho = 0 => beta(hat) : 2. plug in beta(hat), transform your Y and X, estimate rho(hat) : 3. repeat steps 1 & 2 until converge.
Y****a 发帖数: 243	5 I remember there was a prove that under certain conditions, the algorithm reach global mle. But forgot what the conditions were :(
h*****7 发帖数: 6781	6 EM没有理论论证全局最优的概率，to the best of my knowledge 记住一条，EM这类方法，不属于概率论范畴，属于随机过程，因为它用了指示器（可参考随机过程计算方法算法导论之类的），所以很难给出理论确界和最优解概率。一般说无限趋近最优解。在GMM条件下，倒是有人系统测试过EM的解和最优解有多远另外MLE本来就没道理的，就是通俗说法的屁股决定脑袋。就算解出全局最优，对参数估计也是imperfect solution，至于后续Wilk's theorem，也是asymptotic的，所以LZ 就别要求太高了 LZ这种情况，没法用时序或者频域作分析，用EM是比较理想的靠，说了一堆，回头看发现对LZ没啥帮助，还是疑似我老马甲的YueJia讲得好
l*******m 发帖数: 1096	7 kalman filter. 就是em算法。研究几十年了【在 c*******e 的大作中提到】 : Awesome. Upon doing further survey on this topic, I also think this is the : best solution. : Out of curiosity, may I ask a further questions: : given the sample X(t) and Y(t), suppose that beta_star(rho_star) maximized : the likehihood function of all beta given that rho == rho_star, and : rho_star(beta_star) maximized the likelihood function of all rho given that : beta == beta_star, namely this pair of beta_star(rho_star) and : rho_star(beta_star) is the fixed-point which we converged at step (3), is : there : any theoretical guarantee that this pair [beta_star, rho_star] is the global

1

(共1页)

进入DataSciences版参与讨论

相关主题
● SAS regression运行时间太长	● machine learning engineer VS. Data Analytics/BI
● 评价一个变量可预测性问题 (转载)	● 求教分类问题中预测概率的问题
● 如何用python读取大数据	● 请问关于小的dataset evaluation的问题
● 请教预测算法	● [IT+IEOR背景]请教一下往DtSci方向的职业发展建议
● 讨论一下：几种clustering方法的特点，区别，长处各是什么？	● 问一道面试题
● 机器学习日报2015年2月楼	● 报面筋求实习合租 (转载)
● ［请教］一个R问题	● 也问个模型
● 请问DS的面试主要要准备什么？	● 借版面问个machine learning的问题

相关话题的讨论汇总
话题: rho话题: beta话题: star话题: mle话题: regression

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)