如何检索主题的类型

发布于 2024-09-07 14:40:55 字数 910 浏览 4 评论 0原文

据我了解，Freebase 分类法通常可以归结为以下层次结构：

Domain Category > Domain > Type > Topic

我有一个应用程序，它接收输入并进行一些自然语言处理，输出一堆术语 - 有些有用，有些则无用。在系统地“决定”一个术语是否有用的初步努力中，我的想法是通过假设它是一个主题并查看 Freebase 是否将该术语分类为至少一个<强>类型。

所以我现在想做的是，给定一个主题，找到它的类型 ID（最好是名称）。如果没有返回，那就告诉我一些关于所谓主题的信息。如果返回一种或多种类型，那么我不仅可以衡量该术语的有用性，而且还能够覆盖 Freebase 分类法并为人们提供一种不同的访问方法（通过该树比喻）。

例如，我可能会从 NLP 引擎收到“政治”、“政治组织”、“行政”、“照片”、“MSN”等。哪种 MQL 查询可以告诉我哪些类型与这些主题相关（如果有）？

感谢您的帮助。

更新

我刚刚经历了一次重大的拍头时刻。我离开了我已经摆弄了一段时间的查询，当我回来时，我看到了我的方式的错误。我试图让这种方式变得太困难，并且一如既往，我看不到的简单解决方案正是我需要看到的：

[{
  "id": null,
  "name": "Politics",
  "type": [{"id": null, "name": null }]
}]

不过，这给我带来了一个稍微不同的问题。我返回的是多个主题，其中一个是en/politics，还有一堆id是/m/...等。我知道Freebase系统很复杂，但是我距离理解这种复杂性还有很长的路要走。对于这种练习，我最有可能想要 /en/ 主题吗？

原文

As I understand it, the Freebase taxonomy generally boils down to this hierarchy:

Domain Category > Domain > Type > Topic

I have an application that receives input and does a bit of natural language processing that spits out a bunch of terms--some useful and some not. In an initial effort to systematically "decide" whether a term is useful, my thought is to "test" it against Freebase by assuming it's a topic and seeing whether Freebase has the term classified under at least one type.

So what I'm trying to do now is, given a topic, find its type IDs (and names, ideally). If none are returned, that tells me something about the so-called topic. If one or more types is returned, then I not only have some measure of the term's usefulness, but also an ability to overlay the Freebase taxonomy and give folks a different method of accessing it (via that tree metaphor).

For example, I might receive "Politics", "Political organization", "administration", "photo", "MSN", etc. from the NLP engine. What kind of MQL query can tell me which type(s) are connected to those topics, if any?

Thanks for your help.

UPDATE

I just had one of those grandiose head slap moments. I stepped away from the query I'd been tinkering with for a while and when I got back, I saw the error of my ways. I was trying to make this way too difficult and, as always, the simple solution that I couldn't see was exactly what I needed to see:

[{
  "id": null,
  "name": "Politics",
  "type": [{"id": null, "name": null }]
}]

This leads me to a slightly different question, though. What I get back is multiple topics, one of which is en/politics and a bunch of others whose id is /m/..., etc. I understand that the Freebase system is complex, but I'm a long way from understanding that complexity. For this kind of exercise, am I mostly likely to want the /en/ topic?

分享到QQ

分享到微博