当前位置：文江博客话题详情

为什么我们需要时态数据库？

发布于 2024-07-17 22:40:21 字数 210 浏览 6 评论 0原文

我正在阅读有关时态数据库的内容，它们似乎已经建立在时间方面。我想知道为什么我们需要这样一个模型？

它与普通的 RDBMS 有什么不同？难道我们不能拥有一个普通的数据库（即 RDBMS）并拥有一个将时间戳与发生的每个事务相关联的触发器吗？也许性能会受到影响。但我仍然对时态数据库在市场上有充分的理由持怀疑态度。

目前的数据库是否支持这样的功能？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

烟织青萝梦 2024-07-24 22:40:21

考虑一下你的约会/日记——从 1 月 1 日到 12 月 31 日。现在我们可以查询日记中任何一天的约会/日记条目。这种排序称为有效时间。然而，约会/条目通常不是按顺序插入的。

假设我想知道 4 月 4 日我的日记中有哪些约会/条目。也就是4月4日那天我的日记里存在的所有记录。这是交易时间。

鉴于可以创建和删除约会/条目等。典型的记录具有涵盖条目期间的开始和结束有效时间以及指示条目出现在日记中的期间的开始和结束事务时间。

当日记可能进行历史修改时，这种安排是必要的。假设在 4 月 5 日，我意识到 2 月 14 日的约会实际上发生在 2 月 12 日，即我发现我的日记中有错误 - 我可以更正错误，以便更正有效的时间图片，但现在，我查询什么是4 月 4 日的日记中的日期将是错误的，除非约会/条目的交易时间也被存储。在这种情况下，如果我查询截至 4 月 4 日的日记，它将显示 2 月 14 日存在约会，但如果我查询截至 4 月 6 日的日记，它将显示 2 月 12 日有约会。

时态数据库的这种时间旅行特性使得记录有关如何在数据库中纠正错误的信息成为可能。这对于记录修订时间并允许查询数据如何修订的真实数据审核图片是必要的。
时间。

大多数业务信息应存储在这种双时态方案中，以便提供真实的审计记录并最大化商业智能 - 因此需要关系数据库的支持。请注意，每个数据项在二维时间模型中占据一个（可能无界）正方形，这就是人们经常使用 GIST 索引来实现双时态索引的原因。这里的问题是，GIST 索引实际上是为地理数据设计的，而对时间数据的要求有些不同。

PostgreSQL 9.0 排除约束应该提供组织时态数据的新方法，例如，同一元组的事务和有效时间段不应重叠。

Consider your appointment/journal diary - it goes from Jan 1st to Dec 31st. Now we can query the diary for appointments/journal entries on any day. This ordering is called the valid time. However, appointments/entries are not usually inserted in order.

Suppose I would like to know what appointments/entries were in my diary on April 4th. That is, all the records that existed in my diary on April 4th. This is the transaction time.

Given that appointments/entries can be created and deleted etc. A typical record has a beginning and end valid time that covers the period of the entry and a beginning and end transaction time that indicates the period during which the entry appeared in the diary.

This arrangement is necessary when the diary may undergo historical revision. Suppose on April 5th I realise that the appointment I had on Feb 14th actually occurred on February 12th i.e. I discover an error in my diary - I can correct the error so that the valid time picture is corrected, but now, my query of what was in the diary on April 4th would be wrong, UNLESS, the transaction times for appointments/entries are also stored. In that case if I query my diary as of April 4th it will show an appointment existed on February 14th but if I query as of April 6th it would show an appointment on February 12th.

This time travel feature of a temporal database makes it possible to record information about how errors are corrected in a database. This is necessary for a true audit picture of data that records when revisions were made and allows queries relating to how data have been revised over
time.

Most business information should be stored in this bitemporal scheme in order to provide a true audit record and to maximise business intelligence - hence the need for support in a relational database. Notice that each data item occupies a (possibly unbounded) square in the two dimensional time model which is why people often use a GIST index to implement bitemporal indexing. The problem here is that a GIST index is really designed for geographic data and the requirements for temporal data are somewhat different.

PostgreSQL 9.0 exclusion constraints should provide new ways of organising temporal data e.g. transaction and valid time PERIODs should not overlap for the same tuple.

回复收藏 0 原文

你与清晨阳光 2024-07-24 22:40:21

时态数据库通常通过具有一些固定的时间尺度（例如秒甚至毫秒）然后仅存储测量数据的变化来有效地存储数据的时间序列。 RDBMS 中的时间戳是每次测量的离散存储值，效率非常低。时态数据库通常用于 SCADA 等实时监控应用程序。 OSISoft (http://www.osisoft.com/) 的 PI 数据库是一个完善的系统。

回复收藏 0 原文

七色彩虹 2024-07-24 22:40:21

据我了解（并且过度简化），时态数据库记录有关数据何时有效以及数据本身的事实，并允许您查询时态方面。您最终会处理“有效时间”和“交易时间”表，或涉及“有效时间”和“交易时间”方面的“双时表”。您应该考虑阅读这两本书中的任何一本书：

Darwen、Date 和 Lorentzos“时态数据和关系模型”（绝版）
和（完全不同的极端）“在 SQL 中开发面向时间的数据库应用程序”，Richard T. Snodgrass，Morgan Kaufmann Publishers, Inc.，旧金山，1999 年 7 月，504+xxiii 页，ISBN 1-55860-436-7。该文件已绝版，但可在他的网站 cs.arizona.edu< /a> （因此 Google 搜索很容易找到）。

回复收藏 0 原文

天涯离梦残月幽梦 2024-07-24 22:40:21

时态数据库经常用于金融服务行业。原因之一是您很少（如果有的话）被允许删除任何数据，因此记录上的 ValidFrom - ValidTo 类型字段用于提供记录何时正确的指示。

回复收藏 0 原文

昵称有卵用 2024-07-24 22:40:21

除了“我可以用它做哪些新事情”之外，考虑“它统一了哪些旧事物？”可能会很有用。时态数据库代表“普通”SQL 数据库的特定概括。因此，它可以为您提供一个统一的解决方案来解决以前看似不相关的问题。例如：

Web 并发当您的数据库具有允许多个用户执行标准创建/更新/删除 (CRUD) 修改的 Web UI 时，您必须面对并发 Web 更改问题。基本上，您需要检查传入的数据修改是否不会影响自该用户上次查看这些记录以来已更改的任何记录。但是，如果您有一个时态数据库，它很可能已经将“修订 ID”之类的内容与每个记录相关联（由于使时间戳唯一且单调上升的困难）。如果是这样，那么这就成为自然的“内置”机制，用于防止在数据库更新期间破坏其他用户的数据。
法律/税务记录 法律体系（包括税收）比大多数程序员更加重视历史数据。因此，您经常会发现有关发票架构的建议，并警告您小心删除记录或以自然方式进行规范化方式——这可能导致无法回答基本的法律问题，例如“忘记他们当前的地址，您在 2001 年将此发票邮寄到哪个地址？” 有了时态框架基础，所有这些问题的阴谋（它们通常是拥有时态数据库的一半步骤）都消失了。您只需使用最自然的模式，并在有意义时删除，因为您知道您始终可以返回并准确回答历史问题。

另一方面，时间模型本身已经完成了修订控制的一半，这可以激发进一步的应用。例如，假设您在 SQL 之上推出自己的临时设施并允许分支，就像在版本控制系统中一样。即使是有限的分支也可以很容易地提供“沙箱”——随意使用和修改数据库的能力，而不会对其他用户造成任何可见的更改。这使得在复杂的数据库上提供高度真实的用户培训变得容易。

具有简单合并功能的简单分支还可以简化一些常见的工作流程问题。例如，非营利组织可能有志愿者或低薪工人进行数据输入。为每个工作人员提供自己的分支可以让主管在将其合并到“普通”用户可见的主分支之前轻松审查其工作或对其进行增强（例如，去重）。分支机构还可以简化权限。如果用户仅被授予使用/查看其独特分支的权限，则您不必担心阻止所有可能的不需要的修改；您只会合并有意义的更改。

Besides "what new things can I do with it", it might be useful to consider "what old things does it unify?". The temporal database represents a particular generalization of the "normal" SQL database. As such, it may give you a unified solution to problems that previously appeared unrelated. For example:

Web Concurrency When your database has a web UI that lets multiple users perform standard Create/Update/Delete (CRUD) modifications, you have to face the concurrent web changes problem. Basically, you need to check that an incoming data modification is not affecting any records that have changed since that user last saw those records. But if you have a temporal database, it quite possibly already associates something like a "revision ID" with each record (due to the difficulty of making timestamps unique and monotonically ascending). If so, then that becomes the natural, "already built-in" mechanism for preventing the clobbering of other users' data during database updates.
Legal/Tax Records The legal system (including taxes) places rather more emphasis on historical data than most programmers do. Thus, you will often find advice about schemas for invoices and such that warns you to beware of deleting records or normalizing in a natural way--which can lead to an inability to answer basic legal questions like "Forget their current address, what address did you mail this invoice to in 2001?" With a temporal framework base, all the machinations to those problems (they usually are halfway steps to having a temporal database) go away. You just use the most natural schema, and delete when it make sense, knowing that you can always go back and answer historical questions accurately.

On the other hand, the temporal model itself is half-way to complete revision control, which could inspire further applications. For example, suppose you roll your own temporal facility on top of SQL and allow branching, as in revision control systems. Even limited branching could make it easy to offer "sandboxing" -- the ability to play with and modify the database with abandon without causing any visible changes to other users. That makes it easy to supply highly realistic user training on a complex database.

Simple branching with a simple merge facility could also simplify some common workflow problems. For example, a non-profit might have volunteers or low-paid workers doing data entry. Giving each worker their own branch could make it easy to allow a supervisor to review their work or enhance it (e.g., de-duplification) before merging it into the main branch where it would become visible to "normal" users. Branches could also simplify permissions. If a user is only granted permission to use/see their unique branch, you don't have to worry about preventing every possible unwanted modification; you'll only merge the changes that make sense anyway.

回复收藏 0 原文

还不是爱你 2024-07-24 22:40:21

除了阅读维基百科文章之外？维护“审核日志”或类似事务日志的数据库将具有一些“临时”属性。如果您需要有关谁在何时对谁做了什么等问题的答案，那么时态数据库就是您的最佳选择。

回复收藏 0 原文

徒留西风 2024-07-24 22:40:21

您可以想象一个简单的时态数据库，它每隔几秒记录一次您的 GPS 位置。压缩这些数据的机会很大，一个普通的数据库需要为每一行存储一个时间戳。如果您需要大量的吞吐量，那么知道数据是临时的并且永远不需要更新和删除行可以让程序降低典型 RDBMS 中继承的大量复杂性。

尽管如此，时态数据通常只存储在普通的 RDBMS 中。例如，PostgreSQL 有一些时间扩展，这使得这变得更容易一些。

回复收藏 0 原文

韶华倾负 2024-07-24 22:40:21

我想到两个原因：

有些针对插入和只读进行了优化，可以提供显着的性能改进
有些比传统 SQL 对时间有更好的理解 - 允许按秒、分钟、小时等对操作进行分组

回复收藏 0 原文

孤君无依 2024-07-24 22:40:21

只是一个更新，时态数据库即将出现在 SQL Server 2016 中。

为了消除您的所有疑问，为什么需要时态数据库，而不是使用自定义方法进行配置，以及如何高效和高效地配置时态数据库。 SQL Server 会为您无缝配置它，请在此处查看 Channel9.msdn 上的深入视频和演示：https://channel9.msdn.com/Shows/Data-Expose/Temporal-in-SQL-Server-2016

MSDN 链接：https://msdn.microsoft.com/en-us/library/dn935015(v=sql .130).aspx

目前，您可以在 SQL Server 2016 的 CTP2（测试版 2）版本中使用它。

观看此视频了解如何在 SQL Server 2016 中使用临时表。

回复收藏 0 原文