有没有可扩展的开源文件存储系统?
有趣的是,哪些软件正在使用 dropbox、rapidshare、hotfile 等公司来管理大量文件?有没有开源的系统。我查看了基于 mongo-db 构建的 gridFS。考虑到速度(与 nginx 相比),这似乎不是最佳选择。还是我错了?
我想要可以无限扩展的系统。通过将服务器插入系统。列表中为 100 TB。
It is interesting what software are using companies like dropbox, rapidshare, hotfile, and other to manage huge amount of files? Is there any open source system. I took a look on gridFS which is build on mongo-db. It looks like this is not the best choice because of speed (comparing to nginx). Or am i wrong?
I want system which can scale infinitely. By plugging servers in to the system. At list to 100 TB.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(6)
查看 MogileFS - http://danga.com/mogilefs/ - 由 Danga Interactive 开发的开源文件系统,用于使用通过他们的 LiveJournal.com 服务。
如果您不想/不能使用 Amazon S3 等云服务并运行自己的服务器,那么 MogileFS 可能是正确的选择。 MogileFS 可靠,管理和扩展非常简单且便宜。
不幸的是,我无法提供与其他文件系统的任何性能比较或基准。但您不应该期望 MogileFS 排名第一,因为它在应用程序级别上工作,这也可能是一个优势。
另请参阅 Google 代码页了解详细信息:http://code.google.com/p/mogilefs/
Checkout MogileFS - http://danga.com/mogilefs/ - an open source filesystem developed by Danga Interactive for using with their LiveJournal.com services.
If you don't want to/cannot use cloud services like Amazon's S3 and run your own servers, than MogileFS might be the right choice. MogileFS is reliable, management and scaling is very easy and cheap.
Unfortunately, I cannot provide any performance comparison or benchmarks to other filesystems. But you shouldn't expect MogileFS for the 1st place, cause it's working on application level, which can also be an advantage.
See also the Google Code page for more infos: http://code.google.com/p/mogilefs/
Dropbox 基于 Amazon 的 S3 Dropbox - 我的文件存储在哪里。
您可以在这里找到一些开源选项:Amazon S3 的替代品
Dropbox is built on Amazon's S3 Dropbox - Where are my files stored.
You can find some open source options over here: Alternatives to Amazon S3
Hadoop 的 HDFS 是一个可扩展的文件系统。另一个选择是 GlusterFS
Hadoop's HDFS is a scalable file system. Another option is GlusterFS
GlusterFs
是一个开源分布式文件系统。与 HDFS 不同的是,它没有集中的元数据。这意味着 GlusterFS 没有单点故障。GlusterFs
is an open source distributed file system. UnlikeHDFS
it doesn't have a centralized metadata. Which meansGlusterFS
has no single point of failure.对于发现这个问题的 googler:
FB 有太多文件,他们必须编写自己的文件系统;多个文件基本上存储在 1 个巨大的文件中,文件之间有标记。这样做是为了减少系统中的文件数量。
您需要的似乎是 Big Table 的 Google 文件系统
For the googler out there who finds this question:
FB has so many files that they had to write their own file system; multiple files are basically stored into 1 giant file with markers between them. This is done in order to reduce the number of files in the system.
What you need seems like Big Table's Google File System
gridFS 有容量限制是什么意思?您能否更具体一些或指出您阅读的一些文档,因为我不知道 gridFS 的任何容量限制。
What do you mean by gridFS has capacity limits ? Can you please be more specific or point to some documentation where you read that since I am not aware of any capacity limits with gridFS.