当前位置：文江博客话题详情

分布式版本控制系统真的没有集中存储库吗？

发布于 2024-08-26 07:23:26 字数 179 浏览 5 评论 0原文

这似乎是一个愚蠢的问题，但是如何在没有服务器可供检出的情况下设置工作目录呢？企业如何保存存储库的安全备份副本？

我认为必须有一个中央仓库......但是它到底是如何“分布”的？我一直想到服务器-客户端 (SVN) 与点对点 (GIT) 的区别，但我不认为这是正确的，除非像 GIT 这样的工具依赖于 torrent 风格的技术？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

故事还在继续 2024-09-02 07:23:26

分布式版本控制系统真的没有集中存储库吗？

没有强制中央存储库 - 这只是按照惯例。大多数项目确实有一个中央存储库，但每个存储库都是平等的，因为它们具有完整的历史记录，并且可以在彼此之间推送和拉取补丁。

一种思考方式是，集中式 VCS 固定在星型拓扑中：一个中央集线器充当具有完整存储库的服务器，一个或多个客户端挂在其上。客户通常只有最近干净结帐的副本和有限的历史记录（如果有）。因此大多数操作都需要与服务器进行往返。分支是通过在一个存储库中创建分支来实现的。

在分布式 VCS 中，网络拓扑没有限制。理论上你可以拥有任何你喜欢的形状。您可以为每个团队或子项目拥有一个单独的存储库，并进行阶段提交。您可以拥有一个稳定的存储库和一个不稳定的存储库，以及许多功能分支，等等。并且不存在客户端/服务器的区别——所有节点都是平等的。每个存储库都是独立且完整的，并且可以从任何其他存储库推送和/或拉取更改。首先，您克隆现有存储库（制作您自己的副本以供工作），然后开始进行更改。一旦你进行了第一次提交，你实际上就拥有了一个分支。幸运的是，完成后通常很容易将更改合并回来。

但通常发生的情况是，您拥有一个位于中央服务器上的存储库，这使人们更容易上手并跟踪最新更改的位置。

如何在没有服务器可供检出的情况下设置工作目录？

您的存储库必须从源树的某个位置开始。因此，始终存在第一个存储库，以及最初的一系列签入。假设您想在 Murky 上工作。您可以克隆存储库，这将为您提供一个自己的完整存储库，其中包含所有历史记录和签入。您进行一些更改（从而创建分支），完成后，您将更改推回原处，并在其中合并。两个系统都充当对等体，并且它们在彼此之间推送和拉动变更集。

Mercurial 和 Git 都将存储库保存在隐藏的子目录中，因此一个目录树既包含您的工作副本（可以处于您喜欢的任何状态），也包含存储库本身。

企业如何保存存储库的安全备份副本？

如上所述，您只需拥有一个指定的主存储库，其中包含所有最新合并的更改，并像其他任何东西一样对其进行备份。您甚至可以拥有多个备份存储库，或者在物理上独立的盒子上进行自动克隆。在某些方面，备份更容易。

我认为必须有一个中央存储库...但是它到底是如何“分布”的？我一直想到服务器-客户端 (SVN) 与点对点 (GIT) 的区别，但我不认为这是正确的，除非像 GIT 这样的工具依赖于 torrent 风格的技术？

它不是分布式的，因为不同的客户端有不同的部分，例如点对点文件共享。这实际上与中心化模型形成鲜明对比。

所有 DVCS 存储库都是一等公民。如何安排它们成为一个社会或管理问题，而不是一个技术问题。

Does a Distributed Version Control System really have no centralised repository?

There is no enforced central repository - it is only by convention. Most projects do have a central repository, but each repository is equal in the sense that they have the full history, and can push and pull patches between each other.

One way to think of it is a centralised VCS is fixed in a star topology: one central hub acts as the server with the complete repository, with one or more clients hanging off it. The clients typically only have a copy of the most recent clean checkout, and limited history (if any). So most operations require a round-trip to the server. Branching is achieved by creating branches within the one repository.

In a distributed VCS, there is no limit to the topology of your network. You can theoretically have any shape you like. You can have a separate repository per team or sub-project, and stage commits. You can have a stable repository and an unstable repository, and lots of feature branches, and so on. And there is no client/server distinction - all nodes are equal. Each repository is self-contained and complete, and can push and/or pull changes from any other. To get started, you clone an existing repository (make your own copy to work from), and start making changes. Once you make your first commit, you effectively have a branch. Fortunately, it is usually very easy to merge your changes back when you're done.

But what normally happens is you have one repository which is on a central server, which makes it easier for people to get started, and to keep track of where the latest changes are.

how do you get a working drectory set up without a server to check out from?

Your repository has to start somewhere with a source tree. So there is always a first repository, with the initial series of checkins. Let's say you want to work on Murky. You would clone the repository, which gives you a complete repository of your own, with all the history and checkins. You make some changes (thus creating a branch), and when you're done, you push your changes back, where they get merged. Both systems are acting as peers, and they push and pull changesets between each other.

Both Mercurial and Git keep the repository in a hidden subdirectory, so the one directory tree contains both your working copy (which can be in whatever state you like), and the repo itself.

And how does a business keep a safe backed up copy of the repo?

As above, you simply have a nominated master repository which has all the latest merged changes, and back it up like anything else. You can even have multiple backup repos, or have automated clones on physically separate boxes. In some ways, backing up is easier.

I assume then there must be a central repo... but then how exactly is it 'distributed'? I always thought of a server-client (SVN) Vs peer-2-peer (GIT) distinction, but I don't believe that can be correct unless tools like GIT are dependent on torrent-style technology?

It is not distributed in the sense that different clients have different parts, like peer-to-peer file sharing. It is really just in contrast to the centralised model.

All DVCS repositories are first-class citizens. It becomes a social or managerial question of how to arrange them, rather than a technical issue.

回复收藏 0 原文

怕倦 2024-09-02 07:23:26

回复：“洪流式技术”-您混淆了两个问题，一个是网络拓扑（点对点与服务器/客户端），另一个是服务器权限。这是可以理解的，因为术语几乎相同。但是分布式源代码控制对网络连接模型没有任何要求 - 如果您愿意，您可以通过电子邮件分发变更集。分布式版本控制的重要之处在于，每个人本质上都运行自己的服务器并合并来自其他服务器的更改。当然，您需要能够从某个地方获取初始克隆，以及如何知道“某个地方”在哪里超出了系统本身的范围。没有“跟踪器”程序或任何东西——通常有人在某个地方有一个公共存储库，其地址发布在网站上。但是一旦你克隆了它，你的副本就是一个完整的副本，可以作为其他人克隆的基础。

回复收藏 0 原文