Data Vault 模型何时适合数据仓库模型?
我最近发现了对“数据仓库建模”作为数据仓库模型的引用。我之前见过的模特是Inmon和Kimball。作者提到了由于需要连接而可能出现的性能问题。它看起来是一个不错的模型,但我想知道其中的陷阱。网上有经验报告吗?
I recently found a reference to 'Data Vault Modeling' as a model for data-warehouses. The models I've seen before are Inmon and Kimball. The author refers to possible performance problems due to the joins needed. It looks like a nice model, but I wonder about the gotcha's. Are there any experience reports on-line?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
多年来,我们一直在使用 Data Vault 的自行修改,称为“ Link Modelling',只有实体和链接;从 Neo4j 汲取原理,但在 SQL 数据库中实现。
链接建模和数据仓库都是与 Kimball/Inmon 模型截然不同的思维方式。
我下面的评论涉及使用以下结构构建的系统:一个临时暂存数据库、一个 DWH,然后从 DWH 构建多个集市。还有其他方法来构建 DWH 解决方案,但这非常典型。
使用 Kimball/Inmon
通过数据仓库/链接建模
链接建模 DWH 发挥作用的场合如下(使用 Kimball 术语来表达问题)
在金博尔方法论中,总有办法解决这些问题,但它们需要仔细思考,有时还需要一些突破。
从我们的角度来看,链接建模几乎没有什么缺点。
我与任何营销/生产 Kimball/Inmon 或 Data Vault 方法的公司都没有关系。
We have been using a home-grown modification to Data Vault for a number of years, called 'Link Modelling', which only has entities and links; drawing principles from neo4j, but implementing in a SQL database.
Both Link Modelling and Data Vault are very different ways of thinking to Kimball/Inmon models.
My comments below relate to a system built with the follow structure: a temporary staging database, a DWH, then a number of marts build from the DWH. There are other ways to architect a DWH solution, but this is quite typical.
With Kimball/Inmon
With Data Vault/Link Modelling
The occasions where a Link Modelled DWH comes into its own, are the following (using Kimball terminology to express the issues)
There are always ways around these in Kimball methodology, but they require some careful thought and sometimes some jumping through hoops.
From our perspective, there is little downside to the Link Modelling.
I am not connected with any of the companies marketing/producing Kimball/Inmon or Data Vault methodologies.
您可以在我的博客上找到更多信息:http://danLinstedt.com 以及 datavaultinstitute 的论坛但是
为了给您一个快速/简短的答案:
问题如下:
1)必须接受将原始数据加载到数据仓库的概念
2) 了解由于模型原因,Data Vault 通常不允许“最终用户”直接访问。
可能还有更多,但利大于弊。
请随意查看博客,免费注册/关注。
干杯,
丹·林斯特
You can find a whole lot more information on my blog: http://danLinstedt.com, and on the forums at datavaultinstitute dot com
But to give you a quick/brief answer to your question:
The gotchas are as follows:
1) Have to accept the concept of loading raw data to the data warehouse
2) Understand that the Data Vault usually doesn't allow "end-users" direct access because of the model.
There may be a few more, but the benefits outweigh the drawbacks.
Feel free to check out the blog, it's free to register/follow.
Cheers,
Dan Linstedt