PLY：C 解析器中的令牌转移问题

发布于 2024-07-05 13:35:00 字数 641 浏览 14 评论 0原文

我正在使用 PLY 编写一个 C 解析器，最近遇到了一个问题。此代码：

typedef int my_type;
my_type x;

是正确的 C 代码，因为 my_type 之前被定义为类型被这样使用。我通过在中填充类型符号表来处理它词法分析器使用解析器来区分类型和简单的标识符。

然而，虽然类型声明规则以 SEMI（“;”标记）结尾，但 PLY 在决定第一行完成之前会从第二行转移标记 my_type。因此，我没有机会将类型符号表中的更新传递给词法分析器，它将 my_type 视为标识符而不是类型。

有解决办法吗？

完整代码位于：http://code。 google.com/p/pycparser/source/browse/trunk/src/c_parser.py 不知道如何创建一个更小的例子。

编辑：

问题已解决。请参阅下面我的解决方案。

原文

I'm writing a C parser using PLY, and recently ran into a problem.
This code:

typedef int my_type;
my_type x;

Is correct C code, because my_type is defined as a type previously to
being used as such. I handle it by filling a type symbol table in the
parser that gets used by the lexer to differentiate between types and
simple identifiers.

However, while the type declaration rule ends with SEMI (the ';' token), PLY shifts the token my_type from the second line before deciding it's done with the first one. Because of this, I have no chance to pass the update in the type symbol table to the lexer and it
sees my_type as an identifier and not a type.

Any ideas for a fix ?

The full code is at: http://code.google.com/p/pycparser/source/browse/trunk/src/c_parser.py
Not sure how I can create a smaller example out of this.

Edit:

Problem solved. See my solution below.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

っ〆星空下的拥抱 2024-07-12 13:35:00

不知道为什么你要在词法分析器中进行这种级别的分析。

词法分析可能应该用于将输入流分离为词法标记（数字、换行、关键字等）。解析阶段应该进行该级别的分析，包括 typedef 的表查找等。

这就是我一直在我选择的工具 lexx 和 yacc 之间划分职责的方式。

回复收藏 0 原文

画▽骨i 2024-07-12 13:35:00

使用来自 Dave Beazley（PLY 的创建者）的一些帮助，我的问题得到了解决。

这个想法是使用特殊的子规则并在其中执行操作。就我而言，我将 declaration 规则拆分为：

def p_decl_body(self, p):
    """ decl_body : declaration_specifiers init_declarator_list_opt
    """
    # <<Handle the declaration here>>        

def p_declaration(self, p):
    """ declaration : decl_body SEMI 
    """
    p[0] = p[1]

在 SEMI 移入后，decl_body 始终在令牌之前减少，因此我的操作会在正确的时间执行。

With some help from Dave Beazley (PLY's creator), my problem was solved.

The idea is to use special sub-rules and do the actions in them. In my case, I split the declaration rule to:

def p_decl_body(self, p):
    """ decl_body : declaration_specifiers init_declarator_list_opt
    """
    # <<Handle the declaration here>>        

def p_declaration(self, p):
    """ declaration : decl_body SEMI 
    """
    p[0] = p[1]

decl_body is always reduced before the token after SEMI is shifted in, so my action gets executed at the correct time.

回复收藏 0 原文