当前位置：文江博客话题详情

“分布式计算”是如何实现的？适用于网络开发或一般编程吗？

发布于 2024-10-08 00:14:42 字数 337 浏览 0 评论 0原文

我即将使用 Apache Hadoop，标题如下：

Apache Hadoop 项目开发开源软件可靠，可扩展的分布式计算。

我可以将“可扩展性”与编程联系起来，但我只是不知道这种“分布式”如何帮助我的开发。根据维基百科：

分布式系统包括多台自治计算机通过计算机进行通信网络。计算机与计算机交互彼此为了达到一个目的共同目标

那么这是否意味着我可以在多台计算机上部署我的网络应用程序并进行某种“密集计算”？我想到的术语是内容交付网络和云计算。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

爱给你人给你 2024-10-15 00:14:42

Web 开发一直与分布式计算有关，因为客户端与其对话的服务器位于不同的计算机上，网页可以从许多服务器中提取资源来构建页面的内容，并且服务器可以与其他计算机对话以实现其目标。 CDN 使这一点比以前更加明显，但实际上它们只是一种演变，在您所要求的内容和用于提供它的硬件之间引入了虚拟化/间接层。

云是采用虚拟化的概念并将其应用于远程托管，包括低级操作系统和高级软件平台。它们真正有趣的是，这使得客户能够实现不同的业务模型（并且也具有不同的风险，但这主要与它是分布式计算这一事实无关，而是与它并不完全在您自己的控制之下）管辖权）。

我发现分布式计算最有效的使用是当你考虑将不同的服务连接在一起时，每个服务都具有不同的功能（这可能是出于技术原因，也可能不是；有时，这是出于商业或法律原因事物必须被划分），并且其中每项服务都可以由多个位置的许多组件提供。在通用能力图的整体背景下平衡性能需求（将组件聚集在一起的力量）和稳健性需求（这往往导致分布和复制）存在并且仍然存在问题。

天哪！那一段听起来就像是可怕的废话！我想说的是，这都是权衡的结果，你应该做好第一次就做不好的准备。

（Hadoop 是一种用于进行分布式文件存储以及在整个数据集上高效应用某些操作类别的机制，这些操作非常适合 MapReduce 或其他类似的分散收集算法。如果那只鞋适合，就使用它。但它并不能解决所有问题，谢天谢地！可以做所有事情的东西往往看起来非常像实际上根本不能做任何事情，并且有用性和可理解性来自于限制。）

Web development has always been about distributed computing, since clients have been on different machines to the servers they talk to, web pages can pull in resources from many servers to build a page's content, and servers may talk to other machines to achieve their goals. CDNs make this more obvious than before, but really they're just an evolution, an introduction of a virtualization/indirection layer between what you ask for and the hardware used to provide it.

Clouds are about taking the concepts of virtualization and applying them to remote hosting, both of low-level OSes and higher-level software platforms. The really interesting thing about them is that this enables different business models on the part of customers (and with different risks too, but that's mostly not related to the fact that it's distributed computing but rather that it is not wholly under your control in your own jurisdiction).

I've found that the most effective use of distributed computing is when you think in terms of connecting together distinct services, each of which with different capabilities (which might be for technical reasons, or might not; sometimes, it's for business or legal reasons that things have to be divided up) and where each of those services may be provided by many components in multiple locations. There are, and continue to remain, issues with balancing the need for performance (which is a force that brings components together) and the need for robustness (which tends to lead to distribution and replication) within the overall context of the general capabilities map.

My goodness! That paragraph sounds like terrible piffle! What I'm trying to say is that it's all trade-offs, and you should be prepared for not getting it right first time.

(Hadoop is a mechanism for doing a distributed file store, and for efficiently applying certain classes of operation – those that fit well with MapReduce or other similar scatter-gather algorithms – across that whole dataset. If that shoe fits, use it. But it doesn't solve all problems, and thank goodness for that! Things that can do everything tend to look very much like things that can't actually do anything at all, and usefulness and comprehensibility come in the restrictions.)

回复收藏 0 原文