Java RandomAccessFile - 处理不同的换行符样式？

发布于 2024-08-26 17:24:30 字数 418 浏览 17 评论 0原文

我正在尝试通过 RandomAccessFile 进行查找，作为算法的一部分，我必须读取一行，然后从该行的末尾向后查找，

例如，

String line = raf.readLine();
raf.seek (raf.getFilePointer() - line.length() + m.start() + m.group().length());

//m is a Matcher for regular expressions

我已经收到了大量的离一错误，但无法不明白为什么。我刚刚发现这是因为我正在读取的某些文件具有 UNIX 风格的换行符 \r\n，而有些文件只有 Windows 风格的 \n。

是否有一个简单的方法可以让 RandomAccessFile 将所有换行符视为 Windows 风格的换行符？

原文

I'm trying to seek through a RandomAccessFile, and as part of an algorithm I have to read a line, and then seek backwards from the end of the line

E.g

String line = raf.readLine();
raf.seek (raf.getFilePointer() - line.length() + m.start() + m.group().length());

//m is a Matcher for regular expressions

I've been getting loads of off-by-one errors and couldn't figure out why. I just discovered it's because some files I'm reading from have UNIX-style linefeeds, \r\n, and some have just windows-style \n.

Is there an easy to have the RandomAccessFile treat all linefeeds as windows-style linefeeds?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

总以为 2024-09-02 17:24:30

您始终可以将流备份两个字节并重新读取它们以查看它是 \r \n 还是 (!\r)\n：

String line = raf.readLine();
raf.seek(raf.getFilePointer()-2);
int offset = raf.read() == '\r' ? 2 : 1;
raf.read(); //discard the second character since you know it is either \n or EOF by definition of readLine
raf.seek (raf.getFilePointer() - (line.length()+offset) + m.start() + m.group().length());

我不确定您要放置文件指针的确切位置，因此请调整2/1 常数适当。如果文件中出现空行 (\n\n)，您可能还需要添加额外的检查，就好像它显示您可能会陷入无限循环而没有代码来跳过它一样。

You could always back the stream up two bytes and re-read them to see if it is \r \n or (!\r)\n:

String line = raf.readLine();
raf.seek(raf.getFilePointer()-2);
int offset = raf.read() == '\r' ? 2 : 1;
raf.read(); //discard the second character since you know it is either \n or EOF by definition of readLine
raf.seek (raf.getFilePointer() - (line.length()+offset) + m.start() + m.group().length());

I'm not sure exactly where you are trying to place the file pointer, so adjust the 2/1 constants appropriately. You may also need to add an extra check for blank lines (\n\n) if they occur in your file, as if it shows up you might get stuck in an infinite loop without code to step past it.

回复收藏 0 原文