如何在亚马逊上托管持久数据存储系统
我正在做一个处理大量数据的项目。我正在考虑在 Ec2 上托管该项目。我打算使用Hadoop来进行计算,并使用一些NoSql系统(例如Hbase/Cassandra)来存储数据。 NoSql系统必须是持久的(我不想丢失我的数据)。据我所知,我需要生成虚拟机来托管 Hadoop 和 NoSql 内容。但虚拟机不是持久的。是否有其他方法可以持久托管数据存储系统(不仅是数据,还有管理数据的系统)并利用 Amazon 提供的计算?
我想我的情况与持续托管数据库的人类似。
I am doing a project which deals with large amount of data. I am thinking to host that project on Ec2. I intend to use Hadoop to do the computing and some NoSql system (e.g. Hbase/Cassandra) to store the data. The NoSql system must be persistent (I don't want to lose my data). As far as I know, I need to spawn VMs to host Hadoop and NoSql stuffs. But the VMs are not persistent. Are there any other ways that I can host the data storage system persistently (not only the data, but the system which manages the data) and make use of the computation Amazon provides?
I guess my scenario is similar to people who host their databases persistently.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
我认为您需要考虑使用“预留实例”和“弹性块存储”(EBS)。
http://aws.amazon.com/ec2/reserved-instances/
http://aws.amazon.com/ebs/
如果我正确理解你的问题,你会想要一个保留你总是离开的例子运行附加到 EBS 卷以持久存储数据。 EBS 还能够为 S3 制作备份“快照商店”。
I think you need to look at using "Reserved Instances" and "Elastic Block Store"(EBS).
http://aws.amazon.com/ec2/reserved-instances/
http://aws.amazon.com/ebs/
If I understand your question correctly, you would want a reserved instance that you always leave running attached to an EBS volume for persistent storage of your data. EBS is able to make backup "snapshops" to S3 as well.
Amazon 提供了名为 SimpleDB 的服务,您可以使用它来持久且灵活地存储数据。根据您的数据要求,您也许还可以使用 Amazon S3。
Amazon provides a service named SimpleDB that you can use to store data persistently and flexibly. Depending on the requirements of your data, you might also be able to use Amazon S3.