简化这个正则表达式

发布于 2024-10-16 10:38:21 字数 246 浏览 2 评论 0原文

我正在为我的编译器课程做一些考前练习，并且需要简化这个正则表达式。

(a U b)*(a U e)b* U (a U b)*(b U e)a*

很明显，e 是空字符串，U 代表并集。

到目前为止，我认为 (a U b)* 之一可以被删除，因为 a U a = a 的并集。然而，我找不到任何其他的简化，并且到目前为止我对其他问题的处理也不是很好。 :(

感谢任何帮助，非常感谢！

原文

I'm doing some pre-exam exercises for my compilers class, and needed to simplify this regular expression.

(a U b)*(a U e)b* U (a U b)*(b U e)a*

Quite obviously, the e is the empty string, and the U stands for union.

So far, I think one of the (a U b)* can be removed, as the union of a U a = a. However, I can't find any other simplifications, and am not doing so well with the other problems thus far. :(

Any help is appreciated, thanks very much!

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

放赐 2024-10-23 10:38:21

首先翻译为该语言的英文描述：

(a U b)*(a U e)b* U (a U b)*(b U e)a*

翻译为：

任何 a 或 b 序列，后跟可选的 a，后跟任意数量的 b。

或者

任意数量的 a 和 b，后跟可选的 b，后跟任意数量的 a s

这里有很多重叠 - 至少 (a U b)*(a U e) 与 (a U b)* 完全相同，因为“任何 a 和 b 序列必然以 a 或 epsilon 结尾（任何字符串都可以以 epsilon 结尾），因此可以消除这些组，留下

(a U b)*b* U (a U b)*a*

翻译为：

任何 a 或 b 序列，后跟任意数量的b。

或者

任意数量的 a 和 b ，后跟任意数量的 a

现在，最外层组的第一部分是相同的，所以让我们将它们折叠成一个

(a U b)*(a* U b*)

翻译为：

任何a或b序列，后跟任意数量的a< /code>s 或任意数字 bs。

现在稍等一下，“任何 As 和 B 序列”必然以“任何 a 序列或任何 b 序列结束”，这意味着任何与第一部分匹配的内容都可以匹配整个正则表达式（因为第二部分的长度可以为零），所以我们为什么不将其设为

(a U b)*

Ta Da.简单的。

First translate to an english description of the language:

(a U b)*(a U e)b* U (a U b)*(b U e)a*

Translates to:

Any sequence of as or bs, followed by an optional a, followed by any number of bs.

Any number of as and bs, followed by an optional b, follwed by any number of as

There is a lot of overlap here - at least (a U b)*(a U e) is exactly the same as (a U b)*, because "Any sequence of as and bs" necessarily either ends with an a or epsilon (as any string can end with epsilon) so those groups can be eliminated, leaving

(a U b)*b* U (a U b)*a*

Translates to:

Any sequence of as or bs, followed by any number of bs.

Any number of as and bs, follwed by any number of as

Now the first section of those to outermost groups is the same, so lets collapse those into one

(a U b)*(a* U b*)

Translates to:

Any sequence of as or bs, followed by any number of as OR by any number bs.

now hold on a minute, "Any sequence of As and Bs" necessarily ends with "Any sequence of as OR any sequence of bs", which means anything which matches the first part can match the whole regex (because the second part can have a length of zero) so why don't we just make it