如何开发 english .com 域名价值评级算法?

发布于 2024-10-09 20:48:14 字数 1718 浏览 9 评论 0原文

我一直在考虑一种算法,该算法应该能够在大多数情况下猜测 english .com 域的价值。

为了实现这一点,我想执行考虑其优点和缺点的测试英文 .com 域名。

我想到的是一个简单的基于点的系统,其中每个域属性都可以被赋予一定的权重来考虑其重要性。

我想到了这些属性:

域字符长度

例如。最初添加 20 分。如果域包含 4 个或更少的字符,则不会减去任何分数。对于每一个额外的字符,都会按指数减去一个或多个分数(字符越多,惩罚越高)。

域字符

例如。最初添加 20 分。如果域仅是字母顺序的,则不会减去任何点。对于每个非字母字符,减去 X 点(再次指数增加)。

域名词

扫描大型离线英语数据库,包括非正式语音,例如。像“tweet”这样的词应该被识别。

问题 1: 我在哪里可以获得此类应用程序中使用的现代英语单词列表?这些列表是免费提供的吗?有没有类似这样的带有非正式用语的列表?

每个字符找到的单词越多,添加的分数就越多。因此,具有大量字符的域名仍然不会获得很多积分。

词语炒作级别

我认为这是一个棘手的问题,但这应该是区分完美但无聊的领域与完美且有趣的领域的原因。

例如,以下域名可能没有那么有价值:www.peanutgalaxy.com

该算法应识别出花生和星系不是网络上非常流行的主题。这只是一个例子。

另一方面,像 www.shopdeals.com 这样的域名应该经过炒作测试,因为商店和优惠在网络上非常受欢迎。

我最初的想法是看看这些关键字在网络上被引用的频率,最好是通过一些数据库。

问题2:这个逻辑是否有缺陷,或者这个炒作水平测试是否有价值?

问题3:这样的“炒作数据库”可用吗?或者还有什么可以离线工作的吗?例如的问题。向 Google 询问,由于要测试的域名很多,因此需要大量请求。

域名拼写错误

像“freemoneyz.com”等域名是一般来说(请注意,我在这篇文章中做了很多假设,但我认为这是必要的)由于拼写错误而没有价值。

问题 4:是否有任何离线 API 可用于检查拼写错误,最好是在 javascript 或一些我可以与自己交互的数据库中。或者单词列表也应该在这里有所帮助吗?

使用辅音、元音等。

易于发音的域名(例如 Google)通常比其他域名更有价值一个不是(例如 Gkyld)。

问题 5:如何测试这样的发音?你检查辅音、元音等吗?有价值的域名有什么特点?这个领域有什么工作吗?我应该去哪里寻找?

这就是我的想法,这引出了我的最后两个问题。

问题 6:您能想到更多 english .com 域名的优点或缺点吗?哪个?您将如何实现这些?

问题7:您认为这个想法有任何优点或全部,还是我太天真了?有什么我应该知道、读到或听到的吗?建议/意见?

谢谢!

I've been thinking about an algorithm that should rougly be able to guess the value of an english .com domain in most cases.

For this to work I want to perform tests that consider the strengths and weaknesses of an english .com domain.

A simple point based system is what I had in mind, where each domain property can be given a certain weight to factor it's importance in.

I had these properties in mind:

domain character length

Eg. initially 20 points are added. If the domain has 4 or less characters, no points are substracted. For each extra character, one or more points are substracted on an exponential basis (the more characters, the higher the penalty).

domain characters

Eg. initially 20 points are added. If the domain is only alphabetic, no points are substracted. For each non-alhabetic character, X points are substracted (exponential increase again).

domain name words

Scans through a big offline english database, including non-formal speech, eg. words like "tweet" should be recognized.

Question 1 : where can I get a modern list of english words for use in such application? Are these lists available for free? Are there lists like these with non-formal words?

The more words are found per character, the more points are added. So, a domain with a lot of characters will still not get a lot of points.

words hype-level

I believe this is a tricky one, but this should be the cause to differentiate perfect but boring domains from perfect and interesting domains.

For example, the following domain is probably not that valueable: www.peanutgalaxy.com

The algorithm should identify that peanuts and galaxies are not very popular topics on the web. This is just an example.

On the other side, a domain like www.shopdeals.com should ring a bell to the hype test, as shops and deals are quite popular on the web.

My initial thought would be to see how often these keywords are references to on the web, preferably with some database.

Question 2: is this logic flawed, or does this hype level test have merit?

Question 3: are such "hype databases" available? Or is there anything else that could work offline? The problem with eg. a query to google is that it requires a lot of requests due to the many domains to be tested.

domain name spelling mistakes

Domains like "freemoneyz.com" etc. are generally (notice I am making a lot of assumptions in this post but that's necessary I believe) not valueable due to the spelling mistakes.

Question 4: are there any offline APIs available to check for spelling mistakes, preferably in javascript or some database that I can use interact with myself. Or should a word list help here as well?

use of consonants, vowels etc.

A domain that is easy to pronounce (eg. Google) is usually much more valueable than one that is not (eg. Gkyld).

Question 5: how does one test for such pronuncability? Do you check for consonants, vowels, etc.? What does a valueable domain have? Has there been any work in this field, where should I look?

That is what I came up with, which leads me to my final two questions.

Question 6: can you think of any more english .com domain strengths or weaknesses? Which? How would you implement these?

Question 7: do you believe this idea has any merit or all, or am I too naive? Anything I should know, read or hear about? Suggestions/comments?

Thanks!

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(2

娇纵 2024-10-16 20:48:14

域名的价值是人们在出售当天准备支付的最高价格 - 这确实是相当随意的,尤其是当未预先注册的域名可以以低于 20 美元的价格购买时。

The value of a domain is the highest price someone is prepared to pay on the day of sale - it really is pretty arbitrary, especially when un pre-registered domain can be bought for less than $20.

岁吢 2024-10-16 20:48:14

域名评估工具正是您所需要的。
这是屏幕截图

或者,您可以制作一个简单的投标网站。如果有人点击表明对某个域感兴趣,那么它就会提高隐藏价格。只有当用户做XYZ的时候才能看到价格,此时社区已经成功自动为你估价了。

The domain evaluator tool is what you're looking for.
Here's a screenshot.

Optionally, you can make a simple bidding website. I someone clicks that there is interest in a certain domain, then it raises the hidden price. Only when a user does XYZ can they see the price, by then the community has successfully done your valuation automatically.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文