数据库版本控制计划：热不热？

发布于 2024-10-06 00:31:25 字数 2080 浏览 13 评论 0原文

基于在网络上阅读、堆栈溢出以及大多数这些关于数据库版本控制的文章来自编码恐怖，我尝试编写一个计划来对一个已有 8 年历史的 php mysql 网站的数据库进行版本控制。

Database Version Control plan
- Create a db as the "Master Database"
- Create a table db_version (id, script_name, version_number, author, comment, date_ran)   
- Create baseline script for schema+core data that creates this db from scratch, run this on Master Db
- Create a "test data" script to load any db with working data
- Modifications to the master db are ONLY to be made through the db versioning process
- Ensure everyone developing against the Master Db has a local db created by the baseline script
- Procedures for commiting and updating from the Master Db
    - Master Db Commit
        - Perform a schema diff between your local db and the master db
        - Perform a data diff on core data between your local db and master db
        - If there are changes in either or both cases, combine these changes into an update script
        - Collect the data to be added to a new row in db_version table, and add an insert for this into the script
            - new version number = latest master db version number +1
            - author
            - comment
        - The script must be named as changeScript_V.sql where V is the latest master db version +1
        - Run the script against the master db
        - If the script executed succesfully, add it to the svn repository
        - Add the new db_version record to your local db_version table      
    - Update from Master Db
        - Update your local svn checkout to have all the latest change scripts available
        - compares your local db_version table to the master db_version table to determine which change scripts to run
        - Run the required change scripts in order against your local db, which will also update your local db_version table

我的第一个问题是，这听起来正确吗？
我的第二个问题是，提交过程每天执行多次似乎有点复杂。有没有一种方法可以可靠地实现自动化？或者我不应该经常提交数据库更改以使其变得重要吗？

原文

Based on reading around the web, stack overflow, and mostly these articles about db versioning that were linked from coding horror, I've made a stab at writing a plan for versioning the database of an 8 year old php mysql website.

Database Version Control plan
- Create a db as the "Master Database"
- Create a table db_version (id, script_name, version_number, author, comment, date_ran)   
- Create baseline script for schema+core data that creates this db from scratch, run this on Master Db
- Create a "test data" script to load any db with working data
- Modifications to the master db are ONLY to be made through the db versioning process
- Ensure everyone developing against the Master Db has a local db created by the baseline script
- Procedures for commiting and updating from the Master Db
    - Master Db Commit
        - Perform a schema diff between your local db and the master db
        - Perform a data diff on core data between your local db and master db
        - If there are changes in either or both cases, combine these changes into an update script
        - Collect the data to be added to a new row in db_version table, and add an insert for this into the script
            - new version number = latest master db version number +1
            - author
            - comment
        - The script must be named as changeScript_V.sql where V is the latest master db version +1
        - Run the script against the master db
        - If the script executed succesfully, add it to the svn repository
        - Add the new db_version record to your local db_version table      
    - Update from Master Db
        - Update your local svn checkout to have all the latest change scripts available
        - compares your local db_version table to the master db_version table to determine which change scripts to run
        - Run the required change scripts in order against your local db, which will also update your local db_version table

My first question is, does this sound correct?
My second question is, the commit process seems a bit complicated to do more than once a day. Is there a way to reliably automate it? Or should I not be commiting database changes often enough for it to matter?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

明天过后 2024-10-13 00:31:25

看了你的建议，似乎不太可行，也不切合实际。
我在一家公司工作，每个数据库使用超过 1k 个表（非常复杂的系统），一切都运行良好，如下所示：

有一个人负责数据库（我们称他为 DBPerson） - 每个脚本/数据库更改都有穿过他。这将避免任何不必要的更改，以及一些“忽略”的问题（例如，如果有人移动索引以更好地执行他的查询，您可能会破坏其他人的工作，也许有人会创建一个完全多余且不必要的表）， ETC...）。这将使数据库保持干净和高效。即使这对于一个人（或他的副手）来说似乎是太多的工作，但事实上并非如此 - 数据库通常很少改变。
每个脚本都必须通过 DBPerson 验证。
当脚本被批准时，DBPerson 会分配一个编号并将其放入“更新”文件夹/svn(...) 中，并使用适当的编号（如您所建议的，例如增量编号）。
接下来，如果您有一些持续集成，则会拾取脚本并更新数据库（如果您没有持续集成，请手动执行）。
不要存储整个数据库脚本，所有数据都在脚本中。而是存储实际的数据库。如果您有解决方案的分支 - 让每个分支都有自己的数据库，或者您始终可以为每个分支划分更新脚本，以便您可以回滚/转发到另一个分支。但是，我真的建议为每个分支都有一个单独的数据库。
拥有一个始终包含默认数据（完整）的数据库 - 以满足单元测试、回归测试等的需要。无论何时进行测试，都在该数据库的副本上进行。您甚至可以每晚清理主数据库和测试数据库（当然如果合适的话）。

在这样的环境中，您将拥有多个版本的数据库：

开发人员数据库（本地） - 开发人员用来测试其工作的数据库。他总是可以从大师或测试大师那里复制。
主数据库 - 具有所有默认值的数据库，如果您要重新部署到新客户端，则可能是半空的。
测试主数据库 - 充满测试数据的主数据库。您在 Master 上运行的任何脚本也可以在这里运行。
正在进行的测试数据库 - 从测试主复制并用于测试 - 在任何新测试之前被覆盖。
如果您有分支（类似的数据库，每个客户端略有不同），那么您的每个分支将具有与上面相同的...

您肯定必须对此进行修改以匹配您的情况，但无论如何我认为就可维护性、合并、更新等而言，保留整个数据库的创建脚本的文本版本是错误的......

Looking at your proposals, it doesn't seem like something that's feasible nor practical.
I was working in a company where we used more than 1k tables per database (very complex system), and it all worked fine like this:

Have one person in charge of the DB (lets call him DBPerson) - every script/db change has to pass through him. This will avoid any unnecessary changes, and some 'overlooks' of the issues (for example, if someone moves an index to perform better for his query, hi might destroy other persons work, maybe someone will create a table that is completely redundant and unnecessary, etc...). This will keep db clean and efficient. Even if it seems like this is too much work for one guy (or his deputy), in fact it isn't - the db usually rarely changes.
Each script has to pass validation through DBPerson
When the script is approved, the DBPerson assigns a number and puts it in 'update' folder/svn(...), with appropriate numbering (as you suggested, incremental numbers for example).
Next, if you have some continuous integration in place, the script gets picked up and updates the db (if you don't have continuous integration, do it manually).
Do not store entire database script, with all the data in script. Store the actual database instead. If you have branches of the solution - have each branch with it's own database, or you can always have update scripts divided for each of the branches so you could rollback/forward to another branch. But, I really recommend to have a separate db for each branch.
Have one database always with default data (intact) - for needs of unit tests, regression tests etc. Whenever you do the tests, do them on the copy of this database. You could even put a nightly cleanup of the test databases with the main one (if appropriate of course).

In an environment like this you'll have multiple versions of database:

Developers database (local) - the one that the dev guy is using to test his work. He can always copy from Master or Test Master.
Master database - the one with all the default values, maybe semi-empty if you're doing redeploys to new clients.
Test Master database - Master database filled with test data. Any scripts you have ran on Master you ran here as well.
Test in progress database - copied from Test Master and used for testing - gets overwritten prior to any new test.
If you have branches (similar database with slight difference for each of the clients) than you'll have the same as above for each branch...

You will most certainly have to make modifications of this to match your situation, but anyway I think that keeping the textual version of the create script for entire database is wrong in terms of maintainability, merging, updating, etc...

回复收藏 0 原文

~没有更多了~