备份 cvs 存储库的最佳实践?
我们的一些项目仍在简历中。 我们目前使用 tar 每晚备份存储库。
这是问题: 备份 cvs 存储库的最佳实践?
背景:我们正在将全国各地的多台服务器合并到一台中央服务器上。 合并的存储库大小为 14GB。 (是的,这个值很高,很可能是由于大量的二进制文件、许多分支和存储库的年龄)。
cvs 存储库的“直接 tar”会生成约 5gb 的 .tar.gz 文件。 从 5gb tar 文件恢复文件将非常困难。 另外,我们很快就填满了磁带。
完整和增量备份方法(即每周完整备份、每晚增量备份)效果如何? 有哪些开源工具可以很好地解决这个问题? (例如阿曼达、巴库拉)。
谢谢,
比尔
Some of our projects are still on cvs. We currently use tar to backup the repository nightly.
Here's the question:
best practice for backing up a cvs repository?
Context: We're combining a several servers across the country onto one central server. The combined repsitory size is 14gb. (yes this is high, most likely due to lots of binary files, many branches, and the age of the repositories).
A 'straight tar' of the cvs repository yields ~5gb .tar.gz file. Restoring files from 5gb tar files will be unwieldy. Plus we fill up tapes quickly.
How well does a full-and-incremental backup approach, i.e. weekly full backup, nightly incremental backups? What open source tools solve this problem well? (e.g. Amanda, Bacula).
thanks,
bill
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
如果您不需要备份历史记录,可以使用 rsync 在另一台计算机上创建存储库的备份副本。 rsync 以增量模式工作,因此仅在发送更改的文件时才会消耗带宽。
我认为您不需要完整的备份历史记录,因为 VCS 提供了自己的历史记录管理,并且您只需要备份作为故障保护措施。
此外,如果您担心备份存储库的一致状态,您可能需要使用文件系统快照,例如 LVM 可以在 Linux 上生成它们。 据我所知,Solaris的ZFS也有快照功能。
当且仅当您在深夜运行备份过程时,当没有人接触您的存储库并且您的 VCS 守护程序在备份期间停止时,您才不需要快照:-)
You can use rsync to create backup copy of your repo on another machine if you don't need history of backups. rsync works in incremental mode, so bandwidth will be consumed only for sending changed files.
I don't think that you need full history of backups as VCS provides its own history management and you need backups ONLY as failure-protection measure.
Moreover, if you worry about consistent state of backed up repository you MAY want to use filesystem snapshots, e.g. LVM can produce them on Linux. As far as I know, ZFS from Solaris also has snapshots feature.
You don't need snapshots if and only if you run backup procedure deeply at night when noone touches your repo and your VCS daemon is stopped during backup :-)
正如 Darkk 提到的,rsync 可以进行良好的备份,因为只复制收费的内容。 Dirvish 是一个基于 rsync 的不错的备份系统。 备份运行速度很快。 恢复非常简单,因为您所要做的就是复制内容。 有效存储多个版本的备份。
As Darkk mentioned rsync makes for good backups since only charged things are copied. Dirvish is a nice backup system based on rsync. Backups run quickly. Restores are extremely simple since all you have to do is copy things. Multiple versions of the backups are store efficiently stored.