由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
JobHunting版 - 请教大家一道ML设计题
相关主题
yelp skype面离成功转码还有多远? (转载)
What is the worst part about working at Google?[合集] 那个Google random generate 1-7的题怎么做啊?
报个offer,统计讨论个idea题
发一个MSFT bing的onsite面经问一道题
大家来讨论一下 software engineer-machine learning || data mining 的要求把。一道面试碰到的概率题
g家店面求教Careercup 150 上的一道题目
Machine learning / data science 面经以及一些总结MS on-site
东岸 data science /CS内推机会问个关于Epic的题目:Mingo
相关话题的讨论汇总
话题: upvotes话题: review话题: ml话题: let话题: yelp
进入JobHunting版参与讨论
1 (共1页)
x*****0
发帖数: 452
1
I've encountered this question online. How to design an algorithm to predict
how many upvotes a review would get in a certain time period after
publishing. Let's say the review comes from Yelp. So the problem becomes the
following:
> Give millions of reviews (text) and their associated upvotes in a
> certain time period, how do you design a ML predictive model?
Definitely, a lot of prior information may affect the upvotes of a certain
review would got. For example:
(1) Who wrote this review, elite or a regular user?
(2) The business of the review. A review for a hot restaurant may get
more upvotes than for a car mechanic. Generally speaking, when you have a
Yelp's review record, it contains a business id.
(3) ...
Let's pass these prior information and focus on the text features. Can
anyone give me some suggestions. I'm thinking using LDA (topic model) to
generate a topics dictionary but still did not come out a complete solution.
关于如何从text中generate features,大家有什么好的想法吗?
1 (共1页)
进入JobHunting版参与讨论
相关主题
问个关于Epic的题目:Mingo大家来讨论一下 software engineer-machine learning || data mining 的要求把。
吾波第三季亏损再破记录g家店面
[合集] 绿卡重要,还是career重要Machine learning / data science 面经以及一些总结
问一下关于CPT的事情东岸 data science /CS内推机会
yelp skype面离成功转码还有多远? (转载)
What is the worst part about working at Google?[合集] 那个Google random generate 1-7的题怎么做啊?
报个offer,统计讨论个idea题
发一个MSFT bing的onsite面经问一道题
相关话题的讨论汇总
话题: upvotes话题: review话题: ml话题: let话题: yelp