用 C（或只是算法）编写程序来搜索和替换文件中的字符的有效方法是什么？

发布于 2024-10-12 20:39:28 字数 420 浏览 8 评论 0原文

用户将在运行时提供 2 个字符串，例如“asdf”“qwer”，现在，每个出现的“a”都应替换为“q”，“s”替换为“w”，“d”替换为“e”，“f”替换为“r” 字符串的长度可能会有所不同。现在的重点是要操作的文件很大，3-4 TB，所以我们需要一个效率为“n”或“n(log(n))”的高效程序，一系列 if...else不会有帮助。给出的提示是： 1.>该文件没有特殊字符或空格。它仅由小写字符组成 2.> 程序应该利用文件中只有 26 个字符的事实。 3.>最后，使用字符的 ascii 值以某种方式完成解决方案。

其他细节文件应该是关于一个人的论文，所以它不是一个序列。是的，我们必须按顺序读取整个文件，唯一不应该做的就是对每个字符进行比较，即 if(a)then(q)elseif(s)then(w)....something...更有效率？？？

请帮忙

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

许一世地老天荒 2024-10-19 20:39:28

在程序开头创建一个包含 26 个字符的数组。然后替换这个数组中你想要的那些。然后解析整个文件，用表值替换每个字符。

char charsToReplace = "asdf";
char replaceBy = "qwer";
charsToReplaceCount = 4;

char replaceTable[26] = {'a', 'b', 'c', ... , 'z'}

for (int i=0; i<charsToReplaceCount; ++i)
{
    replaceTable[charsToReplace[i] - 'a'] = replaceBy[i];
}

...

for (int i=0; i<fileLengthChunk; ++i)
{
    file[i] = replaceTable[file[i] - 'a'];
}

由于文件很大，我跳过了文件的读写以及分块。

Create an array at the beginning of the program containing 26 characters. Then replace the ones you want in this array. Then parse the whole file replacing every characters with your table values.

char charsToReplace = "asdf";
char replaceBy = "qwer";
charsToReplaceCount = 4;

char replaceTable[26] = {'a', 'b', 'c', ... , 'z'}

for (int i=0; i<charsToReplaceCount; ++i)
{
    replaceTable[charsToReplace[i] - 'a'] = replaceBy[i];
}

...

for (int i=0; i<fileLengthChunk; ++i)
{
    file[i] = replaceTable[file[i] - 'a'];
}

I've skipped read and write of the file as well as the chunking since file is huge.

回复收藏 0 原文

我很坚强 2024-10-19 20:39:28

您首先搜索“要替换”字符串中的第一个字符，一旦找到一个实例，您就开始处理“要替换”字符串，检查每个后续字符，如果找到完全匹配，那么您将替代品。

如果字符串的长度并不总是相同，您将需要读入文件并将修改后的文件写出？我建议这将分块完成，除非您可以在内存中托管 4TB。

基本的伪代码是：

objectstr = "asdf";
targetstr = "qwer";
while not eof
{
   filechar = readchar;
   if (filechar == objectstr[0])
   {
      if (remainingfilechars > length(objectstr)-1)
      {
          match = true;
          for i = 1 to length(objectstr)-1
          {
              filechar = readchar
              if (filechar != objectstr[i])
              {
                  match = false;
                  break;
              }
          }
          if (match)
          {
              writefile(targetstr);
          }
          else
          {
              fileseek(currentfileposition - (length(objectstr)-1));
              writefile(filechar);
          }
       }
    }
    else
    {
        writefile(filechar);
    }
}

You'd start by searching for the first character in the 'to be replaced' string, once you found an instance you start working through your 'to be replaced' string checking each subsequent character, if a complete match is found then you make the replacement.

If the strings are not always the same length, you are going to need to read the file in and write the modified file out? I'd suggest that this would be done in chunks, unless you can host 4TB in memory.

The basic pseudo code would be:

objectstr = "asdf";
targetstr = "qwer";
while not eof
{
   filechar = readchar;
   if (filechar == objectstr[0])
   {
      if (remainingfilechars > length(objectstr)-1)
      {
          match = true;
          for i = 1 to length(objectstr)-1
          {
              filechar = readchar
              if (filechar != objectstr[i])
              {
                  match = false;
                  break;
              }
          }
          if (match)
          {
              writefile(targetstr);
          }
          else
          {
              fileseek(currentfileposition - (length(objectstr)-1));
              writefile(filechar);
          }
       }
    }
    else
    {
        writefile(filechar);
    }
}

回复收藏 0 原文

~没有更多了~