当前位置：文江博客话题详情

谷歌使用什么数据库？

发布于 2024-07-10 09:18:24 字数 36 浏览 7 评论 0原文

是 Oracle 或 MySQL 还是他们自己构建的东西？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

焚却相思 2024-07-17 09:18:24

Bigtable

结构化数据的分布式存储系统

Bigtable是一个分布式存储
用于管理结构化数据的系统（由 Google 构建）
旨在扩展到非常
大尺寸：跨 PB 级数据
数千个商品服务器。
Google 的许多项目都将数据存储在
Bigtable，包括网络索引，
谷歌地球和谷歌财经。
这些应用非常
对 Bigtable 的不同要求，都在
数据大小方面（从 URL 到 Web
卫星图像页面）和
延迟要求（来自后端
批量处理实时数据
服务）。
尽管有这些不同
要求，Bigtable 已成功
提供了灵活的、高性能的
所有这些 Google 的解决方案
产品。

一些特性

快速且超大规模的 DBMS
是一种稀疏的、分布式的多维排序映射，具有面向行和面向列数据库的共同特征。
旨在扩展到 PB 范围，
它可以在数百或数千台机器上运行，
可以轻松地向系统添加更多机器，并自动开始利用这些资源，无需任何重新配置。
每个表都有多个维度（其中一个是时间字段），允许版本控制）
表针对 GFS（Google 文件系统）进行了优化，方法是拆分为多个数据块 - 表的各个部分沿着所选行拆分，使得数据块的大小约为 200 兆字节。

架构

BigTable 不是关系数据库。它不支持连接，也不支持丰富的类似 SQL 的查询。每个表都是一个多维稀疏映射。表格由行和列组成，每个单元格都有一个时间戳。一个单元可以有多个版本，具有不同的时间戳。时间戳允许执行诸如“选择此网页的'n'个版本”或“删除早于特定日期/时间的单元格”之类的操作。

为了管理巨大的表，Bigtable 在行边界处分割表并将它们保存为片剂。一个tablet大约200MB，每台机器节省100个tablet左右。此设置允许单个表中的平板电脑分布在许多服务器上。它还允许细粒度的负载平衡。如果一个表正在接收许多查询，它可以摆脱其他平板电脑或将繁忙的表移动到另一台不那么繁忙的机器上。此外，如果一台机器出现故障，平板电脑可能会分布在许多其他服务器上，以便对任何给定机器的性能影响最小。

表存储为不可变的 SSTable 和日志尾部（每台机器一个日志）。当机器耗尽系统内存时，它会使用 Google 专有的压缩技术（BMDiff 和 Zippy）来压缩一些平板电脑。小压缩只涉及几个tablet，而大压缩则涉及整个表系统并回收硬盘空间。

Bigtable 片剂的位置存储在单元格中。任何特定平板电脑的查找均由三层系统处理。客户端获得一个指向 META0 表的点，该表只有一个。 META0 表跟踪许多 META1 片剂，其中包含正在查找的片剂的位置。 META0 和 META1 都大量使用预取和缓存来最大限度地减少系统瓶颈。

实现

BigTable 构建于 Google 文件系统 (GFS) 之上，用作日志和数据文件的后备存储。 GFS 为 SSTables 提供可靠的存储，SSTables 是一种用于保存表数据的 Google 专有文件格式。

BigTable 大量使用的另一个服务是 Chubby，这是一种高可用、可靠的分布式锁服务。 Chubby 允许客户端获取锁，可能将其与一些元数据相关联，它可以通过将保持活动消息发送回 Chubby 来更新元数据。锁存储在类似文件系统的分层命名结构中。

Bigtable 系统中存在三种主要服务器类型：

主服务器：将平板电脑分配给平板电脑服务器，跟踪平板电脑的位置并根据需要重新分配任务。
平板电脑服务器：当平板电脑和拆分平板电脑超过大小限制（通常为 100MB - 200MB）时，处理平板电脑和拆分平板电脑的读/写请求。如果一个tablet服务器出现故障，那么100个tablet服务器每台都会拾取1个新的tablet，系统就会恢复。
锁服务器：Chubby 分布式锁服务的实例。 BigTable 中的许多操作都需要获取锁，包括打开 Tablet 进行写入、确保一次不超过一个活动 Master 以及访问控制检查。

Google 研究论文的示例：

alt text

示例表的一部分
存储网页。行名称是
反向网址。内容栏
family 包含页面内容，并且
锚柱族包含
引用的任何锚点的文本
页。 CNN 的主页引用自
无论是《体育画报》还是《体育画报》
MY-看主页，所以排
包含名为
锚点：cnnsi.com 和
锚点：my.look.ca。每个锚细胞
有一个版本；内容栏
有三个版本（按时间戳）
t3、t5 和 t6。

API

BigTable 的典型操作是创建和删除表和列族、写入数据以及从行中删除列。 BigTable 通过 API 向应用程序开发人员提供此功能。事务在行级别受支持，但不支持跨多个行键。

以下是研究论文 PDF 的链接。

在这里您可以找到视频，其中展示了 Google 的 Jeff Dean在华盛顿大学的一次演讲中，讨论了 Google 后端使用的 Bigtable 内容存储系统。

Bigtable

A Distributed Storage System for Structured Data

Bigtable is a distributed storage
system (built by Google) for managing structured data
that is designed to scale to a very
large size: petabytes of data across
thousands of commodity servers.
Many projects at Google store data in
Bigtable, including web indexing,
Google Earth, and Google Finance.
These applications place very
different demands on Bigtable, both in
terms of data size (from URLs to web
pages to satellite imagery) and
latency requirements (from backend
bulk processing to real-time data
serving).
Despite these varied
demands, Bigtable has successfully
provided a flexible, high-performance
solution for all of these Google
products.

Some features

fast and extremely large-scale DBMS
a sparse, distributed multi-dimensional sorted map, sharing characteristics of both row-oriented and column-oriented databases.
designed to scale into the petabyte range
it works across hundreds or thousands of machines
it is easy to add more machines to the system and automatically start taking advantage of those resources without any reconfiguration
each table has multiple dimensions (one of which is a field for time, allowing versioning)
tables are optimized for GFS (Google File System) by being split into multiple tablets - segments of the table as split along a row chosen such that the tablet will be ~200 megabytes in size.

Architecture

BigTable is not a relational database. It does not support joins nor does it support rich SQL-like queries. Each table is a multidimensional sparse map. Tables consist of rows and columns, and each cell has a time stamp. There can be multiple versions of a cell with different time stamps. The time stamp allows for operations such as "select 'n' versions of this Web page" or "delete cells that are older than a specific date/time."

In order to manage the huge tables, Bigtable splits tables at row boundaries and saves them as tablets. A tablet is around 200 MB, and each machine saves about 100 tablets. This setup allows tablets from a single table to be spread among many servers. It also allows for fine-grained load balancing. If one table is receiving many queries, it can shed other tablets or move the busy table to another machine that is not so busy. Also, if a machine goes down, a tablet may be spread across many other servers so that the performance impact on any given machine is minimal.

Tables are stored as immutable SSTables and a tail of logs (one log per machine). When a machine runs out of system memory, it compresses some tablets using Google proprietary compression techniques (BMDiff and Zippy). Minor compactions involve only a few tablets, while major compactions involve the whole table system and recover hard-disk space.

The locations of Bigtable tablets are stored in cells. The lookup of any particular tablet is handled by a three-tiered system. The clients get a point to a META0 table, of which there is only one. The META0 table keeps track of many META1 tablets that contain the locations of the tablets being looked up. Both META0 and META1 make heavy use of pre-fetching and caching to minimize bottlenecks in the system.

Implementation

BigTable is built on Google File System (GFS), which is used as a backing store for log and data files. GFS provides reliable storage for SSTables, a Google-proprietary file format used to persist table data.

Another service that BigTable makes heavy use of is Chubby, a highly-available, reliable distributed lock service. Chubby allows clients to take a lock, possibly associating it with some metadata, which it can renew by sending keep alive messages back to Chubby. The locks are stored in a filesystem-like hierarchical naming structure.

There are three primary server types of interest in the Bigtable system:

Master servers: assign tablets to tablet servers, keeps track of where tablets are located and redistributes tasks as needed.
Tablet servers: handle read/write requests for tablets and split tablets when they exceed size limits (usually 100MB - 200MB). If a tablet server fails, then a 100 tablet servers each pickup 1 new tablet and the system recovers.
Lock servers: instances of the Chubby distributed lock service. Lots of actions within BigTable require acquisition of locks including opening tablets for writing, ensuring that there is no more than one active Master at a time, and access control checking.

Example from Google's research paper:

alt text

A slice of an example table that
stores Web pages. The row name is a
reversed URL. The contents column
family contains the page contents, and
the anchor column family contains the
text of any anchors that reference the
page. CNN's home page is referenced by
both the Sports Illustrated and the
MY-look home pages, so the row
contains columns named
anchor:cnnsi.com and
anchor:my.look.ca. Each anchor cell
has one version; the contents column
has three versions, at timestamps
t3, t5, and t6.

API

Typical operations to BigTable are creation and deletion of tables and column families, writing data and deleting columns from a row. BigTable provides this functions to application developers in an API. Transactions are supported at the row level, but not across several row keys.

Here is the link to the PDF of the research paper.

And here you can find a video showing Google's Jeff Dean in a lecture at the University of Washington, discussing the Bigtable content storage system used in Google's backend.

回复收藏 0 原文

⒈起吃苦の倖褔 2024-07-17 09:18:24

这是他们自己构建的东西 - 称为 Bigtable。

http://en.wikipedia.org/wiki/BigTable

Google 发表了一篇论文，介绍了数据库：

http://research.google.com/archive/bigtable.html

回复收藏 0 原文

蓝天 2024-07-17 09:18:24

Spanner 是 Google 的全球分布式关系数据库管理系统 (RDBMS)，是 < a href="http://en.wikipedia.org/wiki/BigTable" rel="noreferrer">BigTable。谷歌声称它不是一个纯粹的关系系统，因为每个表都必须有一个主键。

此处是该论文的链接。

Spanner 是 Google 的可扩展、多版本、全球分布的、
同步复制数据库。这是第一个系统
在全球范围内分发数据并支持外部一致
分布式事务。本文描述了 Spanner 是如何
结构化、其功能集、各种设计背后的基本原理
决策，以及暴露时钟不确定性的新颖时间 API。这
API及其实现对于支持外部至关重要
一致性和各种强大的功能：非阻塞读入
过去，无锁只读事务和原子模式更改，
横跨整个 Spanner。

Google 发明的另一个数据库是 Megastore。这是摘要：

Megastore是为了满足以下要求而开发的存储系统
今天的交互式在线服务。 Megastore 融合了可扩展性
NoSQL 数据存储的优点与传统 RDBMS 的便利性
新颖的方式，并提供强一致性保证和高
可用性。我们在内部提供完全可序列化的 ACID 语义
细粒度的数据分区。这种划分使我们能够
通过广域网同步复制每个写入
合理的延迟并支持数据中心之间的无缝故障转移。
本文描述了Megastore的语义和复制算法。
它还描述了我们支持广泛的 Google 的经验
使用 Megastore 构建的生产服务。

回复收藏 0 原文

爱的十字路口 2024-07-17 09:18:24

正如其他人提到的，谷歌使用了一种名为 BigTable 的本土解决方案，并且他们已经发布了几篇论文，将其描述到现实世界中。

Apache 人员实现了这些论文中提出的想法，称为 HBase。 HBase 是更大的 Hadoop 项目的一部分，根据他们的网站，该项目“是一个软件平台，可以让人们轻松编写和运行处理大量数据的应用程序。”一些基准测试非常令人印象深刻。他们的网站位于 http://hadoop.apache.org。

回复收藏 0 原文

一身骄傲 2024-07-17 09:18:24

尽管 Google 的所有主要应用程序都使用 BigTable，但他们也使用 MySQL其他（可能是次要的）应用程序。

回复收藏 0 原文

月光色 2024-07-17 09:18:24

而且知道 BigTable 不是关系数据库（如 MySQL）而是一个巨大的（分布式）散列也许也很方便表具有非常不同的特征。您可以在 Google AppEngine 平台上自行试用 BigTable（有限版本）。

除了上面提到的 Hadoop 之外，还有许多其他实现尝试解决与 BigTable 相同的问题（可扩展性、可用性）。我昨天看到一篇不错的博客文章，列出了其中的大多数这里。

回复收藏 0 原文

拿命拼未来 2024-07-17 09:18:24

Google 主要使用 Bigtable。

Bigtable 是一个用于管理结构化数据的分布式存储系统，旨在扩展到非常大的规模。

有关详细信息，请从此处下载该文档。

Google 还在其某些应用程序中使用 Oracle 和 MySQL 数据库。

如果您能添加更多信息，我们将不胜感激。

回复收藏 0 原文

雨巷深深 2024-07-17 09:18:24

Google 服务具有多语言持久性架构。 BigTable 被 YouTube、Google 搜索、Google Analytics 等大多数服务所利用。该搜索服务最初使用 MapReduce 作为其索引基础设施，但后来在 Caffeine 发布期间过渡到 BigTable。

Google Cloud 数据存储在 Google 生产环境中拥有 100 多个面向内部和外部用户的应用程序。 Gmail、Picasa、Google 日历、Android Market 等应用程序 AppEngine 使用 Cloud Datastore & 大型商店。

Google Trends 使用 MillWheel 进行流处理。 Google Ads 最初使用 MySQL，后来迁移到 F1 DB - 一种自定义编写的分布式关系数据库。 YouTube 使用 MySQL 和 Vitess。 Google 在 Google 文件系统的帮助下在商品服务器上存储了 EB 级的数据。

资料来源：Google 数据库： Google 服务如何存储 PB 至 EB 级数据？

YouTube 数据库 – 它如何在不耗尽存储空间的情况下存储如此多的视频？