当前位置：文江博客话题详情

解析数字列表的最佳方法

发布于 2024-12-04 04:13:32 字数 189 浏览 0 评论 0原文

我有一个问题，我需要处理一个数字列表，该列表将在英语句子中。它可以采用以下格式：

项目 1、2 和 3

项目 2 到 5

项目 1 到 20

项目 4 或 8

我最初的本能是编写一个简单的状态机来解析它，但我想知道是否有更好的（更简单）的方式，比如可能是一些正则表达式。有什么建议吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

回忆追雨的时光 2024-12-11 04:13:32

如果您有 C++11，以下解析器 (AXE) 将解析您的所有格式（我没有测试它）：

unsigned i;
auto num = axe::r_unsigned(i);
auto space = axe::r_any(" \t");
auto format1 = num % (*space & ',' & *space) & ~("and" & +space & num);
auto format2 = num & +space & "through" & +space & num;
auto format3 = num & +space & "to" & +space & num;
auto format4 = num & +space & "or" & +space & num;
auto format = "items" & +space & (format1 | format2 | format3 | format4);

如果您没有 C++11，您可以使用 提升::精神。与使用正则表达式相比，编写和调试此类解析器更容易、更短，并且您在创建解析规则和语义操作方面也获得了很大的灵活性。

If you have C++11, the following parser (AXE) will parse all your formats (I didn't test it):

unsigned i;
auto num = axe::r_unsigned(i);
auto space = axe::r_any(" \t");
auto format1 = num % (*space & ',' & *space) & ~("and" & +space & num);
auto format2 = num & +space & "through" & +space & num;
auto format3 = num & +space & "to" & +space & num;
auto format4 = num & +space & "or" & +space & num;
auto format = "items" & +space & (format1 | format2 | format3 | format4);

If you don't have C++11, you can write a similar parser in C++ using boost::spirit. It's easier and shorter to write and debug such parser than using regular expressions, and you also get a lot of flexibility in creating parsing rules and semantic actions.

回复收藏 0 原文