您将如何计算 IMDB 电影评级?
我这样做只是为了学习目的。我无意推翻 IMDB 的方法。
我问自己我拥有 IMDB 或类似的网站。我如何计算电影评级?
我能想到的只是加权平均值(只不过是算术平均值)
对于下面提供的电影数据计算将是
(38591*10 + 27994*9 + 32732*8 + 17864*7 + 7361*6 + 2965*5 + 1562*4 + 1073*3 + 891*2 + 3401*1) / 134434 = 8.17055953
我的评分 8.17055953 与 IMDB 评分(=加权平均值)不匹配。所以我的结论是我在这里遗漏了一些东西或者我的分数不是理想的分数。我可能会错过很多东西。
- 我的成绩有什么问题吗?为什么不理想?
- 如果你必须计算的话。你会怎么做呢?
I'm doing this only for learning purposes. I've no intentions of reversing the methods of IMDB.
I asked myself I owned IMDB or similar website. How would I compute the movie rating?
All I can think of is Weighted Average(which is nothing but Arithmetic Mean)
For a movie data provided below computation would be
(38591*10 + 27994*9 + 32732*8 + 17864*7 + 7361*6 + 2965*5 + 1562*4 + 1073*3 + 891*2 + 3401*1) / 134434 = 8.17055953
My rating 8.17055953 doesn't match with IMDBs rating (=weighted average). So my conclusion is I'm missing something here or my score is not an ideal score. I'm might be missing lot of things.
- Whats wrong with my score? Why is it not ideal?
- If you had to compute. How would you have done it?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
不完全符合主题,但对您有用:
如何计算排名。
我希望这有帮助。
PS 这已在此页面上得到确认:
IMDB Chart
Not exactly on topic, but useful for you:
How the ranking is calculated.
I hope that helps.
P.S. This is confirmed on this page:
IMDB Chart
首先:您的计算(这是un加权平均值,或简称平均值)有一个拼写错误:您有
19562
“4”评分,但您应该有1562
。修正后得到 ~8.17,四舍五入到与 IMDB 的算术平均值相同。接下来,如果您确实想要加权平均值,那么您需要一些东西来衡量!因此,如果用户拥有“声誉”或类似的声誉(即使只是注册的时间长度),您可以将其用作其评级权重的乘数。最后除以权重总和而不是总票数,就得到了加权平均值。
可能需要某种重量的比例因子 - 例如,如果 SO 做了这样的事情,那么乔恩·斯基特(Jon Skeet)的观点携带新重量的二十万倍可能有点不公平-注册用户的意见:)
First up: your calculation (which is an un weighted average, or simply the mean) has a typo: you have
19562
'4' ratings, but you should have1562
. Correcting this gives ~8.17, which rounds to the same as IMDB's Arithmetic mean'.Next, if you do want a weighted average, you need something to weight with! So if for example users have a 'reputation' or similar (even just a length of time registered), you could use that as a multiplier for the weight of their rating. At the end, divide by the sum of weights rather than the total number of votes, and you have a weighted average.
A scaling factor of some kind for the weight would probably be needed - eg if SO did something like this, it might be a little unfair for Jon Skeet's opinion to carry two hundred thousand times the weight of a newly-registered user's opinion :)