如何最好地存储聊天机器人的数据？

发布于 2024-12-05 01:53:51 字数 1223 浏览 1 评论 0原文

我在互联网上寻找聊天机器人。这只是有趣。但现在，我非常喜欢这个主题，所以我想开发自己的聊天机器人。
但第一件事是寻找一种好方法来管理我的聊天机器人的“大脑”。我认为将所有内容保存在 XML 文件中是最好的解决方案，不是吗？
这样文件类型就清楚了。涉及不同名词之间的关系等。当我有一个名词时，例如一棵树。如何才能最好地保存一棵树的叶子、树枝和根。一棵树需要水和阳光才能生存？
我应该这样保存还是以其他方式保存？

这将是我的此树的 XML - 示例：

<nouns>
    <noun id="noun_0">
        <name>tree</name>
        <relationship>
            <has>noun_1</has>
            <has>noun_2</has>
            <has>noun_3</has>
            <need>noun_4</need>
            <need>noun_5</need>
        </relationship>
    </noun>
    <noun id="noun_1">
        <name>root</name>
    </noun>
    <noun id="noun_2">
        <name>branch</name>
        <relationship>
            <has>noun_3</has>
        </relationship>
    </noun>
    <noun id="noun_3">
        <name>leaf</name>
    </noun>
    <noun id="noun_4">
        <name>water</name>
    </noun>
    <noun id="noun_5">
        <name>light</name>
    </noun>

    . . .

</nouns>

原文

I was looking on the internet for chatbots. It was only fun. But now, I love this subject so much that I want to develop my own chatbot.
But the first thing is to look for a good way to manage the "brain" of my chatbot. I think that it's the best solution to save everything in a XML file, isn't it?
So the file type is clear. Comes to the relationship between different nouns etc. When I have a noun, e.g. a tree. How do I save best that a tree has leaves, branches and roots. And that a tree needs water and sunlight to survive?
Should I save it like that or otherwise?

This would be my XML for this tree-example:

<nouns>
    <noun id="noun_0">
        <name>tree</name>
        <relationship>
            <has>noun_1</has>
            <has>noun_2</has>
            <has>noun_3</has>
            <need>noun_4</need>
            <need>noun_5</need>
        </relationship>
    </noun>
    <noun id="noun_1">
        <name>root</name>
    </noun>
    <noun id="noun_2">
        <name>branch</name>
        <relationship>
            <has>noun_3</has>
        </relationship>
    </noun>
    <noun id="noun_3">
        <name>leaf</name>
    </noun>
    <noun id="noun_4">
        <name>water</name>
    </noun>
    <noun id="noun_5">
        <name>light</name>
    </noun>

    . . .

</nouns>

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

云巢 2024-12-12 01:53:51

数据存储选择：视情况而定

简单、非学习型机器人：XML 就可以

看起来您已经制定了基本的 XML 结构。对于刚开始的人来说，我认为这很好，特别是对于人工智能支持聊天类型的机器人（if userMsg.contains('lega') then print('TOS & Copyright...' ）

当然，切换到任何新格式都需要时间和开销

学习，复杂的机器人：数据库！

如果您想做更大的事情，特别是如果您有CleverBot 记住，我认为您将需要一个数据库，这是因为当您的文件 .. 是一个文件并且是时。对于这种项目，我建议使用一个数据库。

为什么英语很复杂

不久前我写了一个 nieve bayes 垃圾邮件分类器。大约花了10,000 条垃圾邮件以 7% 的准确率“训练”它，需要大约 6 个小时和 1.5GB RAM 来将数据保存在内存中，这是非常困难的，并且无法真正被破解。变成if 'pony' then 'saddle'，因此对于机器人来说，要“学习”最佳响应，您的数据库将变得庞大且非常快。