当前位置：文江博客话题详情

从较长的字符串创建人类可读的短字符串

发布于 2024-09-30 10:37:48 字数 395 浏览 10 评论 0原文

我需要收缩一个字符串，例如......

你会考虑成为一个机器人吗？您将获得每年一次免费换油的机会。”

...更短但仍然人类可识别（需要从选择列表中找到 -我当前的解决方案让用户输入任意标题，其唯一目的是选择）

我想仅提取形成问题的字符串部分（如果可能），然后以某种方式将其减少为类似的内容

会考虑成为机器人

有没有任何语法算法可以帮助我解决这个问题？我认为可能有一些东西可以让 be 只挑选出动词和名词。

由于这只是充当钥匙，因此不必是完美的；我并不是想淡化英语固有的复杂性。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

月下客 2024-10-07 10:37:48

可能太简单了，但我可能会想从“填充词”列表开始：

var fillers = new[]{"you","I","am","the","a","are"};

然后提取问号之前的所有内容（使用正则表达式、字符串混合，无论你喜欢什么），产生“你会考虑成为一个机器人吗”。

然后遍历字符串，提取每个被认为是填充物的单词。

var sentence = "Would you consider becoming a robot";
var newSentence = String.Join("",sentence.Split(" ").Where(w => !fillers.Contains(w)).ToArray());
// newSentence is "Wouldconsiderbecomingrobot".

帕斯卡大小写每个单词将产生您想要的字符串 - 我将把它作为读者的练习。

Probably too simplistic, but I might be tempted to start with a list of "filler words":

var fillers = new[]{"you","I","am","the","a","are"};

Then extract everything before a questionmark (using regex, string mashing, whatever you fancy), yielding you "Would you consider becoming a robot".

Then go through the string extracting every word considered a filler.

var sentence = "Would you consider becoming a robot";
var newSentence = String.Join("",sentence.Split(" ").Where(w => !fillers.Contains(w)).ToArray());
// newSentence is "Wouldconsiderbecomingrobot".

Pascal casing each word would result in your desired string - i'll leave that as an excercise for the reader.

回复收藏 0 原文

少女的英雄梦 2024-10-07 10:37:48

创建一个流行的社交媒体网站。当用户想要加入或发表评论时，提示他们解决验证码。验证码将包括将长字符串的缩短版本与其完整版本进行匹配。您的缩短算法将基于根据验证码结果进行训练的神经网络或遗传算法。

您还可以在网站上出售广告。

回复收藏 0 原文

梦萦几度 2024-10-07 10:37:48

我最终创建了以下扩展方法，它的工作效果出奇的好。感谢 Joe Blow 出色而有效的建议：

    public static string Contract(this string e, int maxLength)
    {
        if(e == null) return e;

        int questionMarkIndex = e.IndexOf('?');
        if (questionMarkIndex == -1)
            questionMarkIndex = e.Length - 1;

        int lastPeriodIndex = e.LastIndexOf('.', questionMarkIndex, 0);

        string question = e.Substring(lastPeriodIndex != -1 ? lastPeriodIndex : 0, questionMarkIndex + 1).Trim();

        var punctuation =
            new [] {",", ".", "!", ";", ":", "/", "...", "...,", "-,", "(", ")", "{", "}", "[", "]","'","\""};

        question = punctuation.Aggregate(question, (current, t) => current.Replace(t, ""));

        IDictionary<string, bool> words = question.Split(' ').ToDictionary(x => x, x => false);

        string mash = string.Empty;
        while (words.Any(x => !x.Value) && mash.Length < maxLength)
        {
            int maxWordLength = words.Where(x => !x.Value).Max(x => x.Key.Length);
            var pair = words.Where(x => !x.Value).Last(x => x.Key.Length == maxWordLength);
            words.Remove(pair);
            words.Add(new KeyValuePair<string, bool>(pair.Key, true));
            mash = string.Join("", words.Where(x => x.Value)
                                       .Select(x => x.Key.Capitalize())
                                       .ToArray()
                );
        }

        return mash;
    }

这将以下内容缩减为 15 个字符：

这没有任何先决条件 - 写一篇文章...：PrereqsWriteEssay
You'已选择一辆车：YouveSelectedCar

I ended up creating the following extension method which does work surprisingly well. Thanks to Joe Blow for his excellent and effective suggestions:

    public static string Contract(this string e, int maxLength)
    {
        if(e == null) return e;

        int questionMarkIndex = e.IndexOf('?');
        if (questionMarkIndex == -1)
            questionMarkIndex = e.Length - 1;

        int lastPeriodIndex = e.LastIndexOf('.', questionMarkIndex, 0);

        string question = e.Substring(lastPeriodIndex != -1 ? lastPeriodIndex : 0, questionMarkIndex + 1).Trim();

        var punctuation =
            new [] {",", ".", "!", ";", ":", "/", "...", "...,", "-,", "(", ")", "{", "}", "[", "]","'","\""};

        question = punctuation.Aggregate(question, (current, t) => current.Replace(t, ""));

        IDictionary<string, bool> words = question.Split(' ').ToDictionary(x => x, x => false);

        string mash = string.Empty;
        while (words.Any(x => !x.Value) && mash.Length < maxLength)
        {
            int maxWordLength = words.Where(x => !x.Value).Max(x => x.Key.Length);
            var pair = words.Where(x => !x.Value).Last(x => x.Key.Length == maxWordLength);
            words.Remove(pair);
            words.Add(new KeyValuePair<string, bool>(pair.Key, true));
            mash = string.Join("", words.Where(x => x.Value)
                                       .Select(x => x.Key.Capitalize())
                                       .ToArray()
                );
        }

        return mash;
    }

This contracts the following to 15 chars: