删除重复项 - ** 仅当重复项按顺序出现时

发布于 2024-10-23 16:26:12 字数 217 浏览 5 评论 0原文

我想做类似于以下的事情，但我只想删除“g”和“g”，因为它们是相继出现的重复项。我也想保持顺序相同。

任何帮助将不胜感激！

我在 MATLAB 中有这个元胞数组：

y = { 'd' 'f' 'a' 'g' 'g' 'w' 'a' 'h'}


ans =

'd'    'f'    'a'    'w'    'a'    'h'

原文

I would like to do something similar to the following, except I would only like to remove 'g' and'g' because they are the duplicates that occur one after each other. I would also like to keep the sequence the same.

Any help would be appreciated!!!

I have this cell array in MATLAB:

y = { 'd' 'f' 'a' 'g' 'g' 'w' 'a' 'h'}


ans =

'd'    'f'    'a'    'w'    'a'    'h'

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

小傻瓜 2024-10-30 16:26:12

当我的第一个答案（如下）用于多个重复项时，出现错误（感谢 grantnz）。这是更新版本：

>> y = { 'd' 'f' 'a' 'g' 'g' 'w' 'a' 'h' 'h' 'i' 'i' 'j'};
>> i = find(diff(char(y)) == 0);
>> y([i; i+1]) = []

y = 

    'd'    'f'    'a'    'w'    'a'    'j'

旧答案

如果您的“单元向量”始终仅包含单个字符元素，您可以执行以下操作：

>> y = { 'd' 'f' 'a' 'g' 'g' 'w' 'a' 'h'}

y = 

    'd'    'f'    'a'    'g'    'g'    'w'    'a'    'h'

>> y(find(diff(char(y)) == 0) + [0 1]) = []

y = 

    'd'    'f'    'a'    'w'    'a'    'h'

There was an error in my first answer (below) when used on multiple duplicates (thanks grantnz). Here's an updated version:

>> y = { 'd' 'f' 'a' 'g' 'g' 'w' 'a' 'h' 'h' 'i' 'i' 'j'};
>> i = find(diff(char(y)) == 0);
>> y([i; i+1]) = []

y = 

    'd'    'f'    'a'    'w'    'a'    'j'

OLD ANSWER

If your "cell vector" always contains only single character elements you can do the following:

>> y = { 'd' 'f' 'a' 'g' 'g' 'w' 'a' 'h'}

y = 

    'd'    'f'    'a'    'g'    'g'    'w'    'a'    'h'

>> y(find(diff(char(y)) == 0) + [0 1]) = []

y = 

    'd'    'f'    'a'    'w'    'a'    'h'

回复收藏 0 原文

时光倒影 2024-10-30 16:26:12

像这样看：当且仅当（1）它是第一个元素或（2）它的前一个元素与它不同并且或者（3）它是最后一个元素时，您想要保留一个元素(4) 其后继者与其不同。所以：

y([true ~strcmp(y(1:(end-1)),y(2:end))] & [~strcmp(y(1:(end-1)),y(2:end)) true])

或者，也许更好，

different = ~strcmp(y(1:(end-1)),y(2:end));
result = y([true different] & [different true]);

Look at it like this: you want to keep an element if and only if either (1) it's the first element or (2) its predecessor is different from it and either (3) it's the last element or (4) its successor is different from it. So:

y([true ~strcmp(y(1:(end-1)),y(2:end))] & [~strcmp(y(1:(end-1)),y(2:end)) true])

or, perhaps better,

different = ~strcmp(y(1:(end-1)),y(2:end));
result = y([true different] & [different true]);

回复收藏 0 原文

爱的十字路口 2024-10-30 16:26:12

这应该有效：

 y([ diff([y{:}]) ~= 0 true])

或者稍微更紧凑

 y(diff([y{:}]) == 0) = []

更正：上面不会删除两个重复项

ind = diff([y{:}]) == 0;
y([ind 0] | [0 ind]) = []

顺便说一句，即使有多个重复序列，这也有效，

例如，

y = { 'd' 'f' 'a' 'g' 'g' 'w' 'a' 'h' 'h'};
ind = diff([y{:}]) == 0;

y([ind 0] | [0 ind]) = []

y = 

     'd'    'f'    'a'    'w'    'a'

This should work:

 y([ diff([y{:}]) ~= 0 true])

or slightly more compactly

 y(diff([y{:}]) == 0) = []

Correction : The above wont remove both the duplicates

ind = diff([y{:}]) == 0;
y([ind 0] | [0 ind]) = []

BTW, this works even if there are multiple duplicate sequences

eg,

y = { 'd' 'f' 'a' 'g' 'g' 'w' 'a' 'h' 'h'};
ind = diff([y{:}]) == 0;

y([ind 0] | [0 ind]) = []

y = 

     'd'    'f'    'a'    'w'    'a'

回复收藏 0 原文

~没有更多了~

关于作者

欲拥i

暂无简介

0 文章

0 评论

23 人气

关注发私信

友情链接

文江博客

删除重复项 - ** 仅当重复项按顺序出现时

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

巷子口的你

微信用户

神妖

鞋纸虽美，但不合脚ㄋ〞

7460852697

ligengkai

友情链接

删除重复项 - ** 仅当重复项按顺序出现时

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

巷子口的你

微信用户

神妖

鞋纸虽美，但不合脚ㄋ〞

7460852697

ligengkai

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。