如何在Flex Lexer中声明和重用角色类？

发布于 2025-01-23 00:54:42 字数 631 浏览 1 评论 0原文

通常，当您想重复使用正则表达式时，可以在声明部分中的FLEX中声明。默认情况下，它们将被括号所包围。例如：

num_seq [0-9]+

%%

{num_seq} return INT;  // will become ([0-9]+)

{num_seq}\.{num_seq} return FLOAT;  // will become ([0-9]+)\.([0-9]+)

但是，我想重复使用一些角色类。我可以定义自定义类，例如[：alpha：]，[：alnum：]等

chars [a-zA-Z]

%%

  // will become (([a-zA-Z]){-}[aeiouAEIOU])+  // ill-formed
  // desired ([a-zA-Z]{-}[aeiouAEIOU])+  // correct
({chars}{-}[aeiouAEIOU])+ return ONLY_CONS;

({chars}{-}[a-z])+ return ONLY_UPPER;

({chars}{-}[A-Z])+ return ONLY_LOWER;

。在他们周围。是否有适当的方法或至关重要的解决方法可以实现这一目标？

原文

Normally, when you want to reuse a regular expression, you can declare it in flex in declaration section. They will get enclosed by parenthesis by default. Eg:

num_seq [0-9]+

%%

{num_seq} return INT;  // will become ([0-9]+)

{num_seq}\.{num_seq} return FLOAT;  // will become ([0-9]+)\.([0-9]+)

But, I wanted to reuse some character classes. Can I define custom classes like [:alpha:], [:alnum:] etc. A toy Eg:

chars [a-zA-Z]

%%

  // will become (([a-zA-Z]){-}[aeiouAEIOU])+  // ill-formed
  // desired ([a-zA-Z]{-}[aeiouAEIOU])+  // correct
({chars}{-}[aeiouAEIOU])+ return ONLY_CONS;

({chars}{-}[a-z])+ return ONLY_UPPER;

({chars}{-}[A-Z])+ return ONLY_LOWER;

But currently, this will fail to compile because of the parenthesis added around them. Is there a proper way or at-least a workaround to achieve this?

分享到QQ

分享到微博