当前位置：文江博客话题详情

练习示例数据库

发布于 2024-07-17 08:54:46 字数 1542 浏览 10 评论 0原文

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

习ぎ惯性依靠 2024-07-24 08:54:46

您可以尝试经典的 MySQL world 数据库。

world.sql 文件可在此处下载：

http://dev.mysql.com/ doc/index-other.html

只需向下滚动到示例数据库，您就会找到它。

回复收藏 0 原文

请止步禁区 2024-07-24 08:54:46

这是一个在线数据库，但您可以尝试使用 stackoverflow 数据库：
https://data.stackexchange.com/stackoverflow/query/new

您还可以在此处下载其转储：

https://archive.org/download/stackexchange

回复收藏 0 原文

逆光飞翔i 2024-07-24 08:54:46

查看 CodePlex 的 Microsoft SQL Server 社区项目和示例

第 3 方编辑

在上面的链接顶部，您可以查看

microsoft sql server 示例github 上
codeplex 上的 msft db 产品示例
新的 Wide World Importers 示例数据库包括 OLTP 和适用于 sql server 2016 及更高版本的 OLAP

回复收藏 0 原文

拿命拼未来 2024-07-24 08:54:46

为什么不下载英文维基百科？有各种大小的压缩 SQL 文件，它对您来说肯定足够大

主要文章是 XML，因此将它们插入数据库有点问题，但您可能会发现那里还有其他适合您的文件。例如，页间链接SQL文件压缩后为2.3GB。看看https://en.wikipedia.org/wiki/Wikipedia:Database_download 了解更多信息。

奥斯卡

回复收藏 0 原文

与酒说心事 2024-07-24 08:54:46

这就是我用来学习sql的： employees-db

这是一个带有集成测试套件的示例数据库，用于测试
您的应用程序和数据库服务器

第 3 方编辑

根据 launchpad.net，数据库已移至 github。

该数据库包含约 30 万条员工记录，其中 280 万条
工资条目。导出数据为167MB，不算大，但是
足够重，对于测试而言并非易事。
数据已生成，因此存在不一致和
微妙的问题。我们没有删除它们，而是决定留下
内容保持不变，并将这些问题用作数据清理练习。

回复收藏 0 原文

尹雨沫 2024-07-24 08:54:46

如果您想要使用真实数据的大型数据库，您可以报名参加 Netflix 有奖竞赛并获得访问权限他们的数据相当大（几GB的条目）。

第 3 方编辑

上面的 URL 不再包含数据集（2016 年 10 月）。有关 Netflix 奖的维基百科页面报道称，有关隐私问题的诉讼已达成和解。

回复收藏 0 原文

白日梦 2024-07-24 08:54:46

你想要巨大的吗？

这是一个小表： create table foo (id int not null Primary key auto_increment, crap char(2000));

插入 foo(crap) 值 ('');

-- 每次运行下一行时， foo 中的行数都会加倍。
插入 foo( crap ) select * from foo;

再运行二十次，您就有超过一百万行可供使用。

是的，如果他正在寻找关系的外表来导航，这不是答案。但如果他想测试性能和优化能力，那么这就可以了。我正是这样做的（然后用随机值更新）来测试我对另一个问题的潜在答案。（并且没有回答它，因为我无法想出比提问者更好的性能。）

如果他问“复杂”，我会给出不同的答案。对我来说，“巨大”意味着“很多行”。

因为你不需要很大的东西来玩表和关系。考虑一个表本身，没有可为空的列。可以有多少种不同种类的行？只有一列，因为所有列都必须有某个值，因为没有列可以为空。

每个可为空的列都会乘以两倍的不同类型可能的行数：该列为空的行，该列不为空的行。

现在，不要孤立地考虑该表。考虑一个作为子表的表：对于与父表具有 FK 的每个子表，即多对一，可以有 0 个、1 个或多个子表。因此，我们将上一步中得到的计数乘以三倍（零行表示零，正好一行表示一行，多行表示两行）。对于任何祖父母来说，父母是许多人，另外三个。

对于多对多关系，我们可以没有关系、一对一、一对多、多对一或多对多。因此，对于我们可以从表中的图表中获得的每个多对多，我们将行乘以九——或者就像两个一对多一样。如果多对多也有数据，我们乘以可为空数。

我们在图表中无法到达的表——那些我们没有直接或间接 FK 的表，不要将表中的行相乘。

通过递归地乘以我们可以到达的每个表，我们可以得出提供每种“种类”所需的行数，并且我们只需要这些来测试模式中的每种可能的关系。而且我们还远没有达到巨大的程度。

You want huge?

Here's a small table: create table foo (id int not null primary key auto_increment, crap char(2000));

insert into foo(crap) values ('');

-- each time you run the next line, the number of rows in foo doubles.
insert into foo( crap ) select * from foo;

run it twenty more times, you have over a million rows to play with.

Yes, if he's looking for looks of relations to navigate, this is not the answer. But if by huge he means to test performance and his ability to optimize, this will do it. I did exactly this (and then updated with random values) to test an potential answer I had for another question. (And didn't answer it, because I couldn't come up with better performance than what that asker had.)

Had he asked for "complex", I'd have gien a differnt answer. To me,"huge" implies "lots of rows".

Because you don't need huge to play with tables and relations. Consider a table, by itself, with no nullable columns. How many different kinds of rows can there be? Only one, as all columns must have some value as none can be null.

Every nullable column multiples by two the number of different kinds of rows possible: a row where that column is null, an row where it isn't null.

Now consider the table, not in isolation. Consider a table that is a child table: for every child that has an FK to the parent, that, is a many-to-one, there can be 0, 1 or many children. So we multiply by three times the count we got in the previous step (no rows for zero, one for exactly one, two rows for many). For any grandparent to which the parent is a many, another three.

For many-to-many relations, we can have have no relation, a one-to-one, a one-to-many, many-to-one, or a many-to-many. So for each many-to-many we can reach in a graph from the table, we multiply the rows by nine -- or just like two one-to manys. If the many-to-many also has data, we multiply by the nullability number.

Tables that we can't reach in our graph -- those that we have no direct or indirect FK to, don't multiply the rows in our table.

By recursively multiplying the each table we can reach, we can come up with the number of rows needed to provide one of each "kind", and we need no more than those to test every possible relation in our schema. And we're nowhere near huge.

回复收藏 0 原文

~没有更多了~

关于作者

匿名的好友

暂无简介

0 文章

0 评论

24 人气

关注发私信

友情链接

文江博客

练习示例数据库

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（7）

第 3 方编辑

3rd party edit

第 3 方编辑

3rd party edit

第 3 方编辑

3rd party edit

关于作者

相关话题

热门标签

推荐作者

苍风燃霜

我的黑色迷你裙

悸初

撧情箌佬

森罗

lyn1245

友情链接

练习示例数据库

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（7）

第 3 方编辑

3rd party edit

第 3 方编辑

3rd party edit

第 3 方编辑

3rd party edit

关于作者

相关话题

热门标签

推荐作者

苍风燃霜

我的黑色迷你裙

悸初

撧情箌佬

森罗

lyn1245

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。