在编译器中定义除保留字之外的变量名

发布于 2024-12-09 03:14:41 字数 553 浏览 5 评论 0原文

我正在尝试使用 JavaCC 为 Java 的子集做一个词法分析器。变量名可以是字母、数字和_的任意组合，以字母开头。我只有一个问题，保留字（例如int，new，...）不能用作变量名，我想知道如何声明它。现在我有这个，首先声明保留字，然后声明变量名的规则，这是否足够，然后由解析器来处理它？

//Reserved words
TOKEN:{
  < TOK_BOOLEAN : "boolean" > |
  < TOK_BREAK : "break" > |
  < TOK_CLASS : "class" >
}

TOKEN:{
  < TOK_ID : <LETTER> (<LETTER>|<DIGIT>|"_")+ > |
  < #DIGIT : ["0"-"9"] > |
  < #LETTER : ["a"-"z"] | ["A"-"Z"] >
}

TOK_ID 是变量名称的规则。

谢谢你，如果有什么不清楚的地方可以问我。

原文

I am trying to do a lexer for a subset of Java with JavaCC. And a variable name can be any combination of letter, digit and _, beginning with a letter. I have only one problem, reserved words (such as int, new, ...) can not be used as a variable name and I was wondering how to declare this. Right now I have this where the reserved words are declared first, and then the rule for variable names, is it enought and then it will be to the parser to deal with it ?

//Reserved words
TOKEN:{
  < TOK_BOOLEAN : "boolean" > |
  < TOK_BREAK : "break" > |
  < TOK_CLASS : "class" >
}

TOKEN:{
  < TOK_ID : <LETTER> (<LETTER>|<DIGIT>|"_")+ > |
  < #DIGIT : ["0"-"9"] > |
  < #LETTER : ["a"-"z"] | ["A"-"Z"] >
}

TOK_ID is the rule for variable name.

Thank you and ask me if something is not clear.

分享到QQ

分享到微博