当前位置：文江博客话题详情

struct buffer_head 效率低下

发布于 2024-10-29 05:10:52 字数 117 浏览 3 评论 0原文

有人能说出为什么 Linux Kernel 2.4 之前使用的 struct buffer_head 结构效率低下的原因以及从内核 2.5.1 开始重新设计的新块层中使用的 struct bio 结构中实现了哪些内容吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

舞袖。长 2024-11-05 05:10:52

在 2.5 之前的内核中，buffer_head 曾经是 IO 的单位。然而，根据 buffer_head 进行 IO 可能效率很低。例如，跨越多个块的单个写入将导致创建多个 buffer_head 对象，从而浪费空间和时间。因此，buffer_head 结构最适合提供块到物理内存的映射，这正是它从 2.5 开始所做的事情（也是它所做的唯一事情）内核开始。

另一方面，struct bio 被设计为后续内核版本中的 IO 单元。 **它代表当前内核中的活动 IO strong>.** 请注意，块到内存的映射仍然保留在 buffer_head 中。然而，buffer_head 现在并没有因包含当前活动 IO 信息的字段而过载。因此，当文件系统等更高级别的代码发出 IO 时，请求就会添加到 request_queue 中。该请求又包含指向 bio 结构的指针。设备驱动程序控制发出 IO 的设备，接收请求并为其提供服务。

struct bio 结构还用于分割 IO，例如，通过 RAID 磁盘设备驱动程序。查看 struct bio 定义，您会发现它维护了一个 io_vector 数组（它们只不过是内存页的偏移量）。此外，bi_idx 用于索引这些 io_vec。通过对不同的磁盘使用不同的 bi_idx 值，每个磁盘将拾取一组不同的 io_vec 进行写入（或读取），因此 IO 将以真正的并行方式完成。

回复收藏 0 原文

何止钟意 2024-11-05 05:10:52

使用来源，卢克！

来自 include/linux/buffer_head.h：

/*
 * Historically, a buffer_head was used to map a single block
 * within a page, and of course as the unit of I/O through the
 * filesystem and block layers.  Nowadays the basic I/O unit
 * is the bio, and buffer_heads are used for extracting block
 * mappings (via a get_block_t call), for tracking state within
 * a page (via a page_mapping) and for wrapping bio submission
 * for backward compatibility reasons (e.g. submit_bh).
 */
struct buffer_head {
[ ... ]

以及来自 linux/include/bio.h：

/*
 * main unit of I/O for the block layer and lower layers (ie drivers and
 * stacking drivers)
 */
struct bio {
[ ... ]

这两个解耦 I/O 特性和“更高级别”的操作（I/ O 通过文件系统）和“低级别”（I/O 通过块开发）。这可以实现更高的性能，因为在某处等待的缓冲层不会阻止块层在其他地方执行工作。关于它的一些解释可以在此处找到，Jens Axboe 对此进行了讨论在这次采访中简要介绍。

Use the source, luke !

From include/linux/buffer_head.h:

/*
 * Historically, a buffer_head was used to map a single block
 * within a page, and of course as the unit of I/O through the
 * filesystem and block layers.  Nowadays the basic I/O unit
 * is the bio, and buffer_heads are used for extracting block
 * mappings (via a get_block_t call), for tracking state within
 * a page (via a page_mapping) and for wrapping bio submission
 * for backward compatibility reasons (e.g. submit_bh).
 */
struct buffer_head {
[ ... ]

And from linux/include/bio.h:

/*
 * main unit of I/O for the block layer and lower layers (ie drivers and
 * stacking drivers)
 */
struct bio {
[ ... ]

The two decouple I/O characteristics and operations for "higher levels" (I/O through filesystems) and "low levels" (I/O through block devs). This allows for higher performance because a buffer layer waiting somewhere doesn't stop the block layer from performing work elsewhere. Some explanation about it can be found here, and Jens Axboe talks about it in this interview briefly.

回复收藏 0 原文

还给你自由 2024-11-05 05:10:52

Robert Love 的 LKD 3rd 对此有非常详细的解释。

旧与新
缓冲区头与新缓冲区头的区别
bio结构很重要。bio结构代表一个I/O
操作，可能包括内存中的一页或多页。另一方面
另一方面，buffer_head 结构代表单个缓冲区，其中
描述磁盘上的单个块。因为缓冲区头绑定到
单个页面中的单个磁盘块，缓冲区头导致
将请求不必要地划分为块大小的块，只是为了
之后重新组装它们。由于生物结构轻，
可以描述不连续的块并且不会不必要地分割 I/O
运营。从 struct buffer_head 切换到提供的 struct bio
还有其他好处：
bio结构体可以很方便的表示高端内存，因为struct bio
仅处理物理页面，而不处理直接指针。
bio结构体既可以表示普通页I/O，也可以表示直接I/O（I/O
不通过页面缓存的操作——参见第 16 章“
页面缓存和页面写回，”有关页面缓存的讨论）。
生物结构可以轻松执行分散-聚集（矢量）
块I/O操作，以及操作中涉及的数据
源自多个物理页面。
生物结构很多
比缓冲头更轻，因为它只包含
表示块 I/O 操作所需的最少信息，而不是
与缓冲区本身相关的不必要的信息。