在 Java 中将文本文件表示为单个单元，并匹配文本中的字符串

发布于 2024-07-18 02:38:31 字数 911 浏览 12 评论 0 原文

如何将文本文件（或 XML 文件）表示为整个字符串，并在其中搜索（或匹配）特定字符串？

我创建了一个 BufferedReader 对象：

BufferedReader input =  new BufferedReader(new FileReader(aFile));

然后我尝试使用 Scanner 类及其选项来指定不同的分隔符，如下所示：

//Scanner scantext = new Scanner(input);
//Scanner scantext = new Scanner(input).useDelimiter("");
Scanner scantext = new Scanner(input).useDelimiter("\n");
while (scantext.hasNext()) {  ... }

使用这样的 Scanner 类我可以逐行或逐字读取文本，但是这对我没有帮助，因为有时在我想要处理的文本中，我

</review><review>

想说：如果您在文本中的任何位置找到“”，请执行以下操作：包含以下下一行（或一段文本）的内容，直到找到“”。问题是和位于文本中的不同位置，有时会粘在其他文本上（因此作为分隔符的空白空间不会别帮我）。

我曾想过我可能会使用Java中的正则表达式API（Pattern和Matcher类），但它们似乎匹配特定的字符串或行，并且我希望将文本作为一个连续的字符串（至少这是我的印象）根据我读到的有关它们的内容）。你能告诉我在这种情况下我应该使用什么结构/方法/类吗？谢谢。

原文

How can I have a text file (or XML file) represented as a whole string, and search for (or match) a particular string in it?

I have created a BufferedReader object:

BufferedReader input =  new BufferedReader(new FileReader(aFile));

and then I have tried to use the Scanner class with its option to specify different delimiters, like this:

//Scanner scantext = new Scanner(input);
//Scanner scantext = new Scanner(input).useDelimiter("");
Scanner scantext = new Scanner(input).useDelimiter("\n");
while (scantext.hasNext()) {  ... }

Using the Scanner class like this I can either read the text line by line, or word by word, but it doesn't help me, because sometimes in the text, which I want to process, I have

</review><review>

and I would like to say: if you find "<review>" anywhere in the text, do something with the following next lines (or piece of text) until you find "</review>". The problem is that <review> and </review> are on different places in the text, and sometimes glued to other text (therefore the empty space as delimiter doesn't help me).

I have thought that I might use the regular expression API in Java (the Pattern and Matcher classes), but they seem to match a particular string or line, and I want to have the text as one continuous string (at least this was my impressions from what I have read about them). Could you tell me what structures/methods/classes I should use in this case? Thank you.

分享到QQ

分享到微博