需要 RDF 样本数据集
大家好 我一直在寻找足够大的语义数据集来对我正在开发的算法进行一些测试。
我的意思是一个已经存在的 RDF/XML 文件,我可以“轻松”上传到 AllegroGraph。我找到了几个虚拟数据集,但它们使用了不切实际的数据,如“char1”、“char2”、“node121”等。一开始没问题。
但现在我需要使用有关真实事物的数据集进行测试,无论是汽车、植物、电影、书籍等。几种的组合将是理想的。具体来说,是一个包含超过 50k 个对象且至少有 3 或 4 个面的对象。 有人告诉我这些数据集就在那里,但我找不到它们。
任何链接、指示或建议都非常受欢迎。另外,如果有更好的网站来发布这个问题,我会遵循建议。
Hi everyone
Ive been looking for some time now for a big enough semantic dataset to do some testing on an algorithm Im developing.
With this I mean an already existing RDF/XML file that I could "easily" upload to AllegroGraph. I have found several dummy datasets but they use unrealistic data, as in "char1", "char2", "node121", etc. Which is ok at first.
But now I need to test using a dataset about real stuff, be it cars, plants, movies, books, etc. A combination of several would be ideal. Specifically one with over 50k object with at least 3 or 4 facets.
I have been told these datasets are somewhere out there but I cant find them.
Any links, pointers or suggestions are more tha welcome. Also if there is a better site to post this question i will follow the advice.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
经过更多时间的研究后,我遇到了一个非常好的选择。
那是 DBpedia.org
他们从维基百科收集所有数据并将其划分为特定部分。
出于测试目的,我很可能会使用
我想我的问题是我仍在熟悉这些概念以及如何在语义网络方面搜索我想要的内容。希望这个链接能帮助更多的人:)
after looking more time I ran into a very good option.
That is DBpedia.org
They collect all the data from Wikipedia and divide it into specific parts.
For my testing purposes I will most probably be using
I guess my problem was that I am still getting familiarized with the concepts and how to search for what i want when it comes to semantic web. Hopefully this link will help more people :)