在 .NET 云中运行分布式计算的框架
我正在考虑开发一个框架来简化在Windows Azure的.NET云环境中运行分布式计算。
目前,Azure(很可能在发布时)完全不适合在云中简单运行分布式查询(详细信息)。 简单对我来说就像DryadLINQ,其中您可以编写一个查询:
var results = from c in collection
where IsLegal(c.Key)
select new
{
Key = Hash(c.Key),
Result = RunModel(c.Value);
};
并让它在集群中的多台机器上远程执行。 无需费心部署、存储或配置。
您可以建议哪些资源、论文或开源项目来检查有关该主题的其他信息(尤其是调度和 DAG 优化)?
我一直在研究 Hadoop(由 Amazon Elastic Map Reduce 使用)和 DryadLINQ 到目前为止。 显然这包括谷歌搜索。
I'm thinking about developing a framework to simplify running distributed computations in .NET cloud environment of Windows Azure.
Azure currently (and by the time of the release, most likely) is completely unsuited for simple running of distributed queries in the cloud (details). Simple for me is something like DryadLINQ where you can write a query:
var results = from c in collection
where IsLegal(c.Key)
select new
{
Key = Hash(c.Key),
Result = RunModel(c.Value);
};
and have it executed remotely over multiple machines in the cluster. No deployments, storages or configurations to bother with.
What resources, papers or open source projects could you advise to check for additional information on the subject (esp. scheduling and DAG optimization)?
I've been digging around Hadoop (used by Amazon Elastic Map Reduce) & DryadLINQ so far. Obviously this includes Googling.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
Google 研究上有很多有趣的论文。
如果您还没有阅读过,MapReduce 论文可能是一个不错的起点还没完成。
There are a lot of interesting papers at Google Research.
The MapReduce Paper might be a good place to start if you haven't read it yet.