NLP算法“填写”搜索词

发布于 2024-12-07 20:46:19 字数 535 浏览 1 评论 0原文

我正在尝试编写一种算法（我假设该算法将依赖于自然语言处理技术）来“填写”搜索词列表。这种东西可能有一个我不知道的名字。这种问题叫什么，什么样的算法会给我以下行为？

输入：

    docs = [
    "I bought a ticket to the Dolphin Watching cruise",
    "I enjoyed the Dolphin Watching tour",
    "The Miami Dolphins lost again!",
    "It was good going to that Miami Dolphins game"
    ], 
    search_term = "Dolphin"

输出：

["Dolphin Watching", "Miami Dolphins"]

基本上应该弄清楚，如果“Dolphin”出现，它实际上总是在二元组“Dolphin Watching”或“Miami Dolphins”中。首选 Python 解决方案。

原文

I'm trying to write an algorithm (which I'm assuming will rely on natural language processing techniques) to 'fill out' a list of search terms. There is probably a name for this kind of thing which I'm unaware of. What is this kind of problem called, and what kind of algorithm will give me the following behavior?

Input:

    docs = [
    "I bought a ticket to the Dolphin Watching cruise",
    "I enjoyed the Dolphin Watching tour",
    "The Miami Dolphins lost again!",
    "It was good going to that Miami Dolphins game"
    ], 
    search_term = "Dolphin"

Output:

["Dolphin Watching", "Miami Dolphins"]

It should basically figure out that if "Dolphin" appears at all, it's virtually always either in the bigrams "Dolphin Watching" or "Miami Dolphins". Solutions in Python preferred.

分享到QQ

分享到微博