[PATCH v3 00/14] ext4: replace ext4_es_insert_extent() when caching on-disk extents

Zhang Yi posted 14 patches 2 days, 11 hours ago
fs/ext4/extents.c        | 135 ++++++++++++++++++++++++---------------
fs/ext4/extents_status.c | 124 ++++++++++++++++++++++++++---------
fs/ext4/inode.c          |  18 +++---
3 files changed, 187 insertions(+), 90 deletions(-)
[PATCH v3 00/14] ext4: replace ext4_es_insert_extent() when caching on-disk extents
Posted by Zhang Yi 2 days, 11 hours ago
From: Zhang Yi <yi.zhang@huawei.com>

Changes since v2:
 - Rebase the codes on ext4.git dev-91ef18b567da.
 - Move the first cleanup patch in v2 to patch 08 to facilitate easier
   backporting.
 - In patch 01, correct the mismatch comments for
   EXT4_EXT_DATA_ENTIRE_VALID1 and EXT4_EXT_DATA_PARTIAL_VALID1.
 - Modify patch 06 and add 07, cleanup the commit message to avoid
   confusion, and don't always drop extent cache before splitting
   extent, instead, do this only after PARTIAL_VALID1 zeroed out or
   split extent fails.
 - In patch 08, mark zero_ex to initialized.
 - In patch 09, correct the word 'tag' to 'lable' in the commit message.
 - In patch 11, add return value check of __es_remove_extent() in
   ext4_es_cache_extent().
 - Collecting RVB tags.

   Thanks for the comments and suggestions from Jan, Ojaswin and Baokun!
   Next, it is necessary to focus on refactoring and cleaning up the
   code related to ext4_split_extent(). Ojaswin is going to take on this
   work since he has already been exploring it on his local branch.

Changes since v1:
 - Rebase the codes based on the latest linux-next 20251120.
 - Add patches 01-05, fix two stale data problems caused by
   EXT4_EXT_MAY_ZEROOUT when splitting extent.
 - Add patches 06-07, fix two stale extent status entries problems also
   caused by splitting extent.
 - Modify patches 08-10, extend __es_remove_extent() and
   ext4_es_cache_extent() to allow them to overwrite existing extents of
   the same status when caching on-disk extents, while also checking
   extents of different stauts and raising alarms to prevent misuse.
 - Add patch 13 to clear the usage of ext4_es_insert_extent(), and
   remove the TODO comment in it.

v2: https://lore.kernel.org/linux-ext4/20251121060811.1685783-1-yi.zhang@huaweicloud.com/
v1: https://lore.kernel.org/linux-ext4/20251031062905.4135909-1-yi.zhang@huaweicloud.com/

Original Description

This series addresses the optimization that Jan pointed out [1]
regarding the introduction of a sequence number to
ext4_es_insert_extent(). The proposal is to replace all instances where
the cache of on-disk extents is updated by using ext4_es_cache_extent()
instead of ext4_es_insert_extent(). This change can prevent excessive
cache invalidations caused by unnecessarily increasing the extent
sequence number when reading from the on-disk extent tree.

[1] https://lore.kernel.org/linux-ext4/ympvfypw3222g2k4xzd5pba4zhkz5jihw4td67iixvrqhuu43y@wse63ntv4s6u/

Cheers,
Yi.

Zhang Yi (14):
  ext4: subdivide EXT4_EXT_DATA_VALID1
  ext4: don't zero the entire extent if EXT4_EXT_DATA_PARTIAL_VALID1
  ext4: don't set EXT4_GET_BLOCKS_CONVERT when splitting before
    submitting I/O
  ext4: correct the mapping status if the extent has been zeroed
  ext4: don't cache extent during splitting extent
  ext4: drop extent cache after doing PARTIAL_VALID1 zeroout
  ext4: drop extent cache when splitting extent fails
  ext4: cleanup zeroout in ext4_split_extent_at()
  ext4: cleanup useless out label in __es_remove_extent()
  ext4: make __es_remove_extent() check extent status
  ext4: make ext4_es_cache_extent() support overwrite existing extents
  ext4: adjust the debug info in ext4_es_cache_extent()
  ext4: replace ext4_es_insert_extent() when caching on-disk extents
  ext4: drop the TODO comment in ext4_es_insert_extent()

 fs/ext4/extents.c        | 135 ++++++++++++++++++++++++---------------
 fs/ext4/extents_status.c | 124 ++++++++++++++++++++++++++---------
 fs/ext4/inode.c          |  18 +++---
 3 files changed, 187 insertions(+), 90 deletions(-)

-- 
2.46.1
Re: [PATCH v3 00/14] ext4: replace ext4_es_insert_extent() when caching on-disk extents
Posted by Theodore Ts'o 5 hours ago
On Sat, 29 Nov 2025 18:32:32 +0800, Zhang Yi wrote:
> Changes since v2:
>  - Rebase the codes on ext4.git dev-91ef18b567da.
>  - Move the first cleanup patch in v2 to patch 08 to facilitate easier
>    backporting.
>  - In patch 01, correct the mismatch comments for
>    EXT4_EXT_DATA_ENTIRE_VALID1 and EXT4_EXT_DATA_PARTIAL_VALID1.
>  - Modify patch 06 and add 07, cleanup the commit message to avoid
>    confusion, and don't always drop extent cache before splitting
>    extent, instead, do this only after PARTIAL_VALID1 zeroed out or
>    split extent fails.
>  - In patch 08, mark zero_ex to initialized.
>  - In patch 09, correct the word 'tag' to 'lable' in the commit message.
>  - In patch 11, add return value check of __es_remove_extent() in
>    ext4_es_cache_extent().
>  - Collecting RVB tags.
> 
> [...]

Applied, thanks!

[01/14] ext4: subdivide EXT4_EXT_DATA_VALID1
        commit: 0f9885eab9182118fd7bfd8cdf8bab6f71f74699
[02/14] ext4: don't zero the entire extent if EXT4_EXT_DATA_PARTIAL_VALID1
        commit: 1fec988b1f71c27c45d31cde6ffe3efdb10657b9
[03/14] ext4: don't set EXT4_GET_BLOCKS_CONVERT when splitting before submitting I/O
        commit: c42e9f199c419f11938b8d411123e3f6719941d4
[04/14] ext4: correct the mapping status if the extent has been zeroed
        commit: 2410e55561cc405c56b9e38d69be1b8fdb6c9722
[05/14] ext4: don't cache extent during splitting extent
        commit: 4b4a6ac831ff347127e46c60a516b3ec42921242
[06/14] ext4: drop extent cache after doing PARTIAL_VALID1 zeroout
        commit: 87d5cb059b8ab1623f5bcebcc0b53e43abd36ae7
[07/14] ext4: drop extent cache when splitting extent fails
        commit: 889085343ddffdf9ccb6be8402469458da6b350f
[08/14] ext4: cleanup zeroout in ext4_split_extent_at()
        commit: 02f8dc1707ceb87656288e6460f3ebb94200ba2c
[09/14] ext4: cleanup useless out label in __es_remove_extent()
        commit: 13cbc168d9ba14822de66fc085e85416cc2fda8e
[10/14] ext4: make __es_remove_extent() check extent status
        commit: ad02a3d000a512aada99cfad13d62c3edfb793de
[11/14] ext4: make ext4_es_cache_extent() support overwrite existing extents
        commit: 41a414d53bfb5c91ea5c73125181568901c74a7a
[12/14] ext4: adjust the debug info in ext4_es_cache_extent()
        commit: 4e84970a460d27f35f3127327c3e131476c06b03
[13/14] ext4: replace ext4_es_insert_extent() when caching on-disk extents
        commit: d494567091eddfeded77017bb9b4dc677046d93d
[14/14] ext4: drop the TODO comment in ext4_es_insert_extent()
        commit: 6fb67ac896900e60f46ee4efba97b372a80370e0

Best regards,
-- 
Theodore Ts'o <tytso@mit.edu>
Re: [PATCH v3 00/14] ext4: replace ext4_es_insert_extent() when caching on-disk extents
Posted by Theodore Tso 5 hours ago
On Mon, Dec 01, 2025 at 11:23:50AM -0500, Theodore Ts'o wrote:
> Applied, thanks!

n.b.  This is on the dev branch, but I plan to not include it in the initial
pull request to Linus, so it can get a bit more soak testing.  I'll
send to Linus after -rc1.

					- Ted