I'm not sure this is the best place to ask your question. Maybe your local librarian could help. Anyway, I'm answering because I've done a lot of work along these lines in the past, and because I've found that programmers and database designers often know where to find data--especially authoritative and standard data.
At the local level (in the USA), we accepted whatever the local Chamber of Commerce gave us. At the national level, we bought lists from InfoUSA. Chamber of Commerce data can be pretty flaky; InfoUSA data is very clean.
Dun & Bradstreet is the closest I know of to a one-stop global business registry. They're not cheap.
RBA, a company in the UK, seems to have a really useful introduction with a global perspective. See Official Company Registers. Much of the data there is free.
I have been doing some research in this area and found a recent paper which discusses an approach to extract, discover (via clustering) and normalize (by an enhanced edit-distance calculation) organization names. NEMO
发布评论
评论(2)
我不确定这是提问的最佳地点。也许您当地的图书馆员可以提供帮助。不管怎样,我回答这个问题是因为我过去在这些方面做了很多工作,而且因为我发现程序员和数据库设计者通常知道在哪里可以找到数据——尤其是权威和标准的数据。
在当地(美国),我们接受当地商会给我们的任何东西。在国家层面,我们从 InfoUSA 购买了列表。商会的数据可能非常不稳定; InfoUSA 的数据非常干净。
邓& Bradstreet 是我所知道的最接近一站式全球企业注册处的机构。它们并不便宜。
RBA,一家英国公司,似乎有一个非常有用的介绍,全球视野。请参阅官方公司登记册。那里的大部分数据都是免费的。
I'm not sure this is the best place to ask your question. Maybe your local librarian could help. Anyway, I'm answering because I've done a lot of work along these lines in the past, and because I've found that programmers and database designers often know where to find data--especially authoritative and standard data.
At the local level (in the USA), we accepted whatever the local Chamber of Commerce gave us. At the national level, we bought lists from InfoUSA. Chamber of Commerce data can be pretty flaky; InfoUSA data is very clean.
Dun & Bradstreet is the closest I know of to a one-stop global business registry. They're not cheap.
RBA, a company in the UK, seems to have a really useful introduction with a global perspective. See Official Company Registers. Much of the data there is free.
我一直在这个领域进行一些研究,并发现了最近的一篇论文,其中讨论了一种提取、发现(通过聚类)和规范化(通过增强的编辑距离计算)组织名称的方法。 NEMO
I have been doing some research in this area and found a recent paper which discusses an approach to extract, discover (via clustering) and normalize (by an enhanced edit-distance calculation) organization names. NEMO