寻找增量备份算法

发布于 2024-12-29 16:34:23 字数 552 浏览 3 评论 0原文

我正在寻找通过 WAN 同步文件和目录的解决方案。

它必须:

  1. 支持基于 Windows 和 Linux 的操作系统
  2. 支持文本和二进制文件
  3. 增量
  4. 恢复损坏的传输
  5. 最好有: 支持级联或分布式备份,以防止单点故障问题。

我已经阅读了 rsync 及其实现。 然而,由于其文件列表解析算法,rsync 可能相对较慢,而且在二进制文件的情况下,“增量”备份在大多数情况下是完整备份,这可能会导致流量过大。 (我知道,在最坏的情况下,无法增量同步二进制文件,只能通过完整备份,就像备份编译产品的文件一样)。 我还在这里检查了几个相关问题(最有用的是这个答案)

所以,我的问题是:有没有(新?)可用于解决上述任务的有用算法/技术/启发法?

任何提示将不胜感激。

I am looking for a solution for files and directories synchronization over WAN.

It must:

  1. support Windows and Linux based OS
  2. support both textual and binary files
  3. be incremental
  4. resume broken transfers
  5. nice-to-have: support cascaded or distributed backups in order to prevent single point of failure problem.

I have read about rsync and its implementations.
However, the rsync could be relatively slow due to its file list parsing algorithm and also in case of binary files the "incremental" backup is a full backup in most cases which can cause heavy traffic. (I am aware of the fact that in the worst case there is no way to sync binary files incrementally but only by full backup like in case of backing up files that are compilation products).
I have also checked several related questions here (the most useful was this answer)

So, my question is: are there any (new?) useful algorithms/techniques/heuristics that can be applied in order to solve the task described above?

Any tips would be appreciated.

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

尛丟丟 2025-01-05 16:34:23

我推荐您RMAN备份算法。与传统的增量备份相比,它具有以下优点:

  1. 能够恢复一块数据文件
  2. 能够自动备份 CONTROLFILE 和 SPFILE
  3. 能够自动删除旧的 ARCHIVE REDOLOG 文件,并自动删除新的 ARCHIVE REDOLOG 文件。
  4. 能够并行执行备份和恢复。
  5. 能够报告备份所需的文件。
  6. 能够恢复失败的备份(无需从头开始)。
  7. 与其他传统备份策略相比,速度要快得多。

I recommend you RMAN backup algorithm. It has the following advantages over the traditional incremental backups:

  1. Ability to Recover one block of datafile
  2. Ability to automatically backup CONTROLFILE and SPFILE
  3. Ability to delete the older ARCHIVE REDOLOG files, with the new one's automatically.
  4. Ability to perform backup and restore with parallelism.
  5. Ability to report the files needed for the backup.
  6. Ability to resume the failed backup (without starting from beginning).
  7. Much faster when compared to other TRADITIONAL backup strategies.
~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文