当前位置：文江博客话题详情

.NET C# - 随机访问文本文件 - 没有简单的方法吗？

发布于 2024-07-09 09:34:43 字数 386 浏览 16 评论 0原文

我有一个文本文件，其中包含多个“记录”。每条记录都包含一个名称和一组数字作为数据。

我正在尝试构建一个类，该类将读取文件，仅显示所有记录的名称，然后允许用户选择他/她想要的记录数据。

第一次浏览文件时，我只读取标头名称，但我可以跟踪标头在文件中的“位置”。我需要随机访问文本文件，以便在用户请求后查找每个记录的开头。

我必须这样做，因为文件太大，无法完全读入内存（1GB+）以及应用程序的其他内存需求。

我尝试使用 .NET StreamReader 类来完成此操作（它提供了非常易于使用的“ReadLine”功能，但无法捕获文件的真实位置（BaseStream 属性中的位置由于类使用的缓冲区）。

在 .NET 中是否没有简单的方法可以做到这一点？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

二智少女 2024-07-16 09:34:43

提供了一些很好的答案，但我找不到一些适用于我非常简单的情况的源代码。就在这里，希望它能节省其他人我花在搜索上的时间。

我指的“非常简单的情况”是：文本编码是固定宽度的，并且行结束字符在整个文件中是相同的。这段代码在我的情况下效果很好（我正在解析一个日志文件，有时我必须在文件中向前查找，然后再回来。我实现的代码足以完成我需要做的事情（例如：只有一个构造函数，并且仅重写 ReadLine())，因此很可能您需要添加代码...但我认为这是一个合理的起点，

public class PositionableStreamReader : StreamReader
{
    public PositionableStreamReader(string path)
        :base(path)
        {}

    private int myLineEndingCharacterLength = Environment.NewLine.Length;
    public int LineEndingCharacterLength
    {
        get { return myLineEndingCharacterLength; }
        set { myLineEndingCharacterLength = value; }
    }

    public override string ReadLine()
    {
        string line = base.ReadLine();
        if (null != line)
            myStreamPosition += line.Length + myLineEndingCharacterLength;
        return line;
    }

    private long myStreamPosition = 0;
    public long Position
    {
        get { return myStreamPosition; }
        set
        {
            myStreamPosition = value;
            this.BaseStream.Position = value;
            this.DiscardBufferedData();
        }
    }
}

以下是如何使用 PositionableStreamReader 的示例：

PositionableStreamReader sr = new PositionableStreamReader("somepath.txt");

// read some lines
while (something)
    sr.ReadLine();

// bookmark the current position
long streamPosition = sr.Position;

// read some lines
while (something)
    sr.ReadLine();

// go back to the bookmarked position
sr.Position = streamPosition;

// read some lines
while (something)
    sr.ReadLine();

There are some good answers provided, but I couldn't find some source code that would work in my very simplistic case. Here it is, with the hope that it'll save someone else the hour that I spent searching around.

The "very simplistic case" that I refer to is: the text encoding is fixed-width, and the line ending characters are the same throughout the file. This code works well in my case (where I'm parsing a log file, and I sometime have to seek ahead in the file, and then come back. I implemented just enough to do what I needed to do (ex: only one constructor, and only override ReadLine()), so most likely you'll need to add code... but I think it's a reasonable starting point.

public class PositionableStreamReader : StreamReader
{
    public PositionableStreamReader(string path)
        :base(path)
        {}

    private int myLineEndingCharacterLength = Environment.NewLine.Length;
    public int LineEndingCharacterLength
    {
        get { return myLineEndingCharacterLength; }
        set { myLineEndingCharacterLength = value; }
    }

    public override string ReadLine()
    {
        string line = base.ReadLine();
        if (null != line)
            myStreamPosition += line.Length + myLineEndingCharacterLength;
        return line;
    }

    private long myStreamPosition = 0;
    public long Position
    {
        get { return myStreamPosition; }
        set
        {
            myStreamPosition = value;
            this.BaseStream.Position = value;
            this.DiscardBufferedData();
        }
    }
}

Here's an example of how to use the PositionableStreamReader:

PositionableStreamReader sr = new PositionableStreamReader("somepath.txt");

// read some lines
while (something)
    sr.ReadLine();

// bookmark the current position
long streamPosition = sr.Position;

// read some lines
while (something)
    sr.ReadLine();

// go back to the bookmarked position
sr.Position = streamPosition;

// read some lines
while (something)
    sr.ReadLine();

回复收藏 0 原文

故人爱我别走 2024-07-16 09:34:43

FileStream有seek()方法。

回复收藏 0 原文

谈下烟灰 2024-07-16 09:34:43

您可以使用 System.IO.FileStream 而不是 StreamReader。如果您确切地知道文件包含什么（例如编码），您可以像使用 StreamReader 一样执行所有操作。

回复收藏 0 原文

三月梨花 2024-07-16 09:34:43

如果您对数据文件的写入方式很灵活并且不介意它对文本编辑器不太友好，则可以使用 BinaryWriter 写入记录：

using (BinaryWriter writer = 
    new BinaryWriter(File.Open("data.txt", FileMode.Create)))
{
    writer.Write("one,1,1,1,1");
    writer.Write("two,2,2,2,2");
    writer.Write("three,3,3,3,3");
}

然后，最初读取每个记录很简单，因为您可以使用 BinaryReader ReadString 方法：

using (BinaryReader reader = new BinaryReader(File.OpenRead("data.txt")))
{
    string line = null;
    long position = reader.BaseStream.Position;
    while (reader.PeekChar() > -1)
    {
        line = reader.ReadString();

        //parse the name out of the line here...

        Console.WriteLine("{0},{1}", position, line);
        position = reader.BaseStream.Position;
    }
}

BinaryReader 没有缓冲，因此您可以获得正确的位置来存储和稍后使用。唯一的麻烦是从行中解析名称，无论如何您可能都必须使用 StreamReader 来完成此操作。

If you're flexible with how the data file is written and don't mind it being a little less text editor-friendly, you could write your records with a BinaryWriter:

using (BinaryWriter writer = 
    new BinaryWriter(File.Open("data.txt", FileMode.Create)))
{
    writer.Write("one,1,1,1,1");
    writer.Write("two,2,2,2,2");
    writer.Write("three,3,3,3,3");
}

Then, initially reading each record is simple because you can use the BinaryReader's ReadString method:

using (BinaryReader reader = new BinaryReader(File.OpenRead("data.txt")))
{
    string line = null;
    long position = reader.BaseStream.Position;
    while (reader.PeekChar() > -1)
    {
        line = reader.ReadString();

        //parse the name out of the line here...

        Console.WriteLine("{0},{1}", position, line);
        position = reader.BaseStream.Position;
    }
}

The BinaryReader isn't buffered so you get the proper position to store and use later. The only hassle is parsing the name out of the line, which you may have to do with a StreamReader anyway.

回复收藏 0 原文

奶茶白久 2024-07-16 09:34:43

编码是固定大小的吗（例如 ASCII 或 UCS-2）？如果是这样，您可以跟踪字符索引（基于您看到的字符数）并根据该索引找到二进制索引。

否则，不 - 您基本上需要编写自己的 StreamReader 实现，它可以让您查看二进制索引。遗憾的是 StreamReader 没有实现这一点，我同意。

回复收藏 0 原文

枯叶蝶 2024-07-16 09:34:43

从 .NET 6 开始，系统中的方法.IO.RandomAccess 类是随机读写文件的官方且受支持的方法。这些 API 与 Microsoft.Win32.SafeHandles.SafeFileHandle 配合使用，可以通过新的 System.IO.File.OpenHandle 函数，也在 .NET 6 中引入。

回复收藏 0 原文