具有共享 S3 存储的 Amazon EC2 Windows AMI
我目前有一个在 Amazon EC2 上创建的基本 Windows 2008 Server AMI。我使用它一次创建 20-30 个基于 EBS 的 EC2 实例,以便为客户将大量数据处理为 PDF。然而,一旦数据处理完成,我必须手动连接到每台机器并复制文件。这需要花费大量的时间和精力,因此我试图找出使用 S3 作为输出的 PDF 文件的集中存储的最佳方法。
我见过许多第三方(商业)实用程序可以将 S3 存储桶映射到 Windows 中的驱动器,但是有没有更好、更明智的方法来实现我想要的功能?之前没有使用过 S3,只使用过 EC2,我不确定有哪些选项可用,而且我也无法在网上找到任何解决使用 S3 作为多个 EC2 Windows 实例的集中存储问题的信息。
更新:感谢使用S3的命令行工具的建议。希望有一些更加综合性和更少临时性的东西。鉴于 EC2 与 S3 密切相关(S3 曾经是 AMI 等的默认存储机制),我可能可以做一些更简洁/更简单的事情。也许甚至围绕私有云网络和 EC2 支持的 S3 服务器等,或者其他什么(我对此一无所知的领域)。没有其他想法吗?
I've currently got a base Windows 2008 Server AMI that I created on Amazon EC2. I use it to create 20-30 EBS-based EC2 instances at a time for processing large amounts of data into PDFs for a client. However, once the data processing is complete, I have to manually connect to each machine and copy off the files. This takes a lot of time and effort, and so I'm trying to figure out the best way to use S3 as a centralised storage for the outputted PDF files.
I've seen a number of third party (commercial) utilities that can map S3 buckets to drives within Windows, but is there a better, more sensible way to achieve what I want? Having not used S3 before, only EC2, I'm not sure of what options are available, and I've not been able to find anything online addressing the issue of using S3 as centralised storage for multiple EC2 Windows instances.
Update: Thanks for suggestions of command line tools for using S3. Was hoping for something a little more integrated and less ad-hoc. Seeing as EC2 is closely related to S3 (S3 used to be the default storage mechanism for AMIs, etc), that there might be something neater/easier I could do. Perhaps even around Private Cloud Networks and EC2 backed S3 servers, etc, or something (an area I know nothing about). No other ideas?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
您可以使用带有通过 samba 导出的 EBS 的 EC2 实例,该实例可以充当 Windows 实例可以映射的集中存储吗?
You could use an EC2 instance with EBS exported through samba which can act as a centralized storage that windows instances can map?
对我来说,这听起来非常像 hadoop/Amazon MapReduce 作业。不幸的是,hadoop 最好部署在 Linux 上:
Windows 服务器上的 Hadoop
我假设您使用的软件pdf处理仅适用于Windows吗?
如果不是这种情况,我会认真考虑将您的解决方案移植到 Linux。
this sounds very much like a hadoop/Amazon MapReduce job to me. Unfortunately, hadoop is best deployed on Linux:
Hadoop on windows server
I assume the software you use for pdf-processing is Windows only?
If this is not the case, I'd seriously consider porting your solution to Linux.
我可能会寻找命令行工具。在 Google 上快速搜索后,我找到了一个 .Net 工具:
http://s3.codeplex.com/
并且Java 版本:
http://www.beaconhill.com/opensource/s3cp.html
我确信还有其他人也在那里。
I'd probably look for a command line tool. A quick search on Google lead me to a .Net tool:
http://s3.codeplex.com/
And a Java one:
http://www.beaconhill.com/opensource/s3cp.html
I'm sure there are others out there as well.