From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pj1-f54.google.com (mail-pj1-f54.google.com [209.85.216.54]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id EDC934595B for ; Sun, 10 Nov 2024 15:29:14 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.54 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252556; cv=none; b=RACJpveFnl77UQ5n/j1/c7OMCWiDYqf0ta3jxghR548c+F2hhMZsL4mtKBnG6mMjQD9X93QZRDYtUoBKPBlZD9mb+VaS+Sl5znzu4NSxct1UvRAwSgYNPJQTFYZgEB67eHm7VvyGu9/rUrIcojIL0u/Z5MvMuAEVnOht+ZkWWCI= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252556; c=relaxed/simple; bh=ZCGFBh3RRNjXY6L8R9jv8JwBpMjmEe8zEwEeTVk4ewI=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=Ac407em3ElmMCZeyqjaaYtyIrBkmUfYKhU69N7nCTCnvlE1VCIQKp5M2zkksE7wo6jd1j+6VLwLPqjCuTL5GdoywqxK+gDwtLfbt7aUeSWkW698n6zMxZjp01On9mBhS49C94o37UDawA2ibkycGVinuwIDPw1qP+RrpYMI2B20= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=KTl4ld6s; arc=none smtp.client-ip=209.85.216.54 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="KTl4ld6s" Received: by mail-pj1-f54.google.com with SMTP id 98e67ed59e1d1-2e2bb1efe78so2713793a91.1 for ; Sun, 10 Nov 2024 07:29:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252554; x=1731857354; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=ATQlrsiKbF4ovWTP3SQYfEdawQE8siSIL4Gn+HYptZ0=; b=KTl4ld6skRxSUjxGhT1V8ehwnsGKMPL+zhhjRER3/6dCrjKJ/md8tyGXKDg0irkY5Y XHzCen6Vqy7NTNVGL3M0RnPhdxZ9sGlA8kB3KXX8Pz+pRXdTpkfZv10ALipRUNipGHo3 rBX5fqdQLoZz6Q9KEuT9ZtOp3bY+oWqfZrKZeg1x3VGfOm+orvUFnKWuqgkft6XoNS0H R3j0lfKERtziifFyeRBvNlzL28fX6o/PwpvzJaQUCwlYsYdxtjohDLWQ7mOPxJRhVplC /BZn/wGFrBxnLh3JwzGV+87H5QVpCPphfebHRsoH16ArCTAhcWZtdGE5yfU2PDUEyiHb YH4Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252554; x=1731857354; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=ATQlrsiKbF4ovWTP3SQYfEdawQE8siSIL4Gn+HYptZ0=; b=uQhZIv0fDtUWOmZ6NQYmQJRNtAcpqMfkEJc5QJLB2LyOGmKUNWHQSzd+odxlr+a5Fh R6CiHxnZEn43XO1wmdtdiOp2fCKMhGGvnDWJtpIXK5jj9co8SN1yzTmxQHNKHIQNgBxr PfR3tx++rEOqw3E0vUH6B+UCfHR69D14rSrO/5cshggYjJumxP/JSLLPZWvn58HGqdCn nuXVWGiJMPjPtQzUb1QpyuNUHxvkLq6WYsIZbtz4aQnF5Zug1CSWKK6CQtRsVY5sLndK 6NR2bSstif8fGcyWXrsUTCrInGGyCCPHjTeyG1djjhD6W0zrIE8S3NdXWt83Chcx/4U+ xMbw== X-Forwarded-Encrypted: i=1; AJvYcCUiXl+0j9vntyE7YbCBO3oAVvUlFibipfuUU/VX0tXZwj88mOtmYUQDnYO0jwwTVmuIXlxP495xTlWlMh0=@vger.kernel.org X-Gm-Message-State: AOJu0Yw1h39QNDh/Enoam3Bp+vmPAoqIUnAX4GVZZSu7fqaAN3Ypdk7E 0o6D8ZxXSjeoF8fYajjduOZmnoR7w764syx5PTKY1RGSG5jK7zZaV8hHmVerYWY= X-Google-Smtp-Source: AGHT+IHudq5kHes2nSMTtH3m+uxQGkBDk1+1Ad9rVBdouFER/SNEbqW3PARIVnuSTIfSKRPsHGRgYA== X-Received: by 2002:a17:90b:388d:b0:2e2:b94c:d6a2 with SMTP id 98e67ed59e1d1-2e9b1e13166mr14624611a91.0.1731252554158; Sun, 10 Nov 2024 07:29:14 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.12 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:13 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 01/15] mm/filemap: change filemap_create_folio() to take a struct kiocb Date: Sun, 10 Nov 2024 08:27:53 -0700 Message-ID: <20241110152906.1747545-2-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Rather than pass in both the file and position directly from the kiocb, just take a struct kiocb instead. While doing so, move the ki_flags checking into filemap_create_folio() as well. In preparation for actually needing the kiocb in the function. No functional changes in this patch. Signed-off-by: Jens Axboe --- mm/filemap.c | 17 +++++++++-------- 1 file changed, 9 insertions(+), 8 deletions(-) diff --git a/mm/filemap.c b/mm/filemap.c index 36d22968be9a..0b187938b999 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -2460,15 +2460,17 @@ static int filemap_update_page(struct kiocb *iocb, return error; } =20 -static int filemap_create_folio(struct file *file, - struct address_space *mapping, loff_t pos, - struct folio_batch *fbatch) +static int filemap_create_folio(struct kiocb *iocb, + struct address_space *mapping, struct folio_batch *fbatch) { struct folio *folio; int error; unsigned int min_order =3D mapping_min_folio_order(mapping); pgoff_t index; =20 + if (iocb->ki_flags & (IOCB_NOWAIT | IOCB_WAITQ)) + return -EAGAIN; + folio =3D filemap_alloc_folio(mapping_gfp_mask(mapping), min_order); if (!folio) return -ENOMEM; @@ -2487,7 +2489,7 @@ static int filemap_create_folio(struct file *file, * well to keep locking rules simple. */ filemap_invalidate_lock_shared(mapping); - index =3D (pos >> (PAGE_SHIFT + min_order)) << min_order; + index =3D (iocb->ki_pos >> (PAGE_SHIFT + min_order)) << min_order; error =3D filemap_add_folio(mapping, folio, index, mapping_gfp_constraint(mapping, GFP_KERNEL)); if (error =3D=3D -EEXIST) @@ -2495,7 +2497,8 @@ static int filemap_create_folio(struct file *file, if (error) goto error; =20 - error =3D filemap_read_folio(file, mapping->a_ops->read_folio, folio); + error =3D filemap_read_folio(iocb->ki_filp, mapping->a_ops->read_folio, + folio); if (error) goto error; =20 @@ -2551,9 +2554,7 @@ static int filemap_get_pages(struct kiocb *iocb, size= _t count, filemap_get_read_batch(mapping, index, last_index - 1, fbatch); } if (!folio_batch_count(fbatch)) { - if (iocb->ki_flags & (IOCB_NOWAIT | IOCB_WAITQ)) - return -EAGAIN; - err =3D filemap_create_folio(filp, mapping, iocb->ki_pos, fbatch); + err =3D filemap_create_folio(iocb, mapping, fbatch); if (err =3D=3D AOP_TRUNCATED_PAGE) goto retry; return err; --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pj1-f43.google.com (mail-pj1-f43.google.com [209.85.216.43]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 5DE284DA04 for ; Sun, 10 Nov 2024 15:29:16 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.43 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252559; cv=none; b=uMjioGgsHX4x7rP6+h6o5tbpi1AKeGeX7wwfo9TepogfJMu0k/2D8tq4HaK6Kb+eJYN/MnZKuZNpZAGIt9lXGZGhjDWvXkFrO+6fFySYqpA0Iw7siN1Rbi0yX5S64YNICM8bQX1Wjl1idHbM4Y7dki7dsQjKxjdlMg8ia3uja/o= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252559; c=relaxed/simple; bh=wdE7j3SXsvf8mZ3WLeO9bDbqT1xBl9uoI5Ta1MifPnA=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=k/xN0+IZVMCfVLFkRhv5LEbLR8EeSJoMT+oazItOlfpjOxC4Nit91QUdmtZ91N3Jjyw/dM+Pec482DrSc8HWq1HxegJ7hx/66+UF1zSuc2sXlkF73iAHcf6/8tIfxzUV3xGdRI101OgypbifhQrL6O7t3jWht44mrEEro8lO8/Q= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=uCQqbBCW; arc=none smtp.client-ip=209.85.216.43 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="uCQqbBCW" Received: by mail-pj1-f43.google.com with SMTP id 98e67ed59e1d1-2e9b55b83d2so1735624a91.3 for ; Sun, 10 Nov 2024 07:29:16 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252556; x=1731857356; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=G7YIma1TSdS4GVKncfhre22V8DGIHalzmKUoqzgwI0E=; b=uCQqbBCWKXwXYqwtTldcjEciA9O/jswHvB8sLHez+k1MKrdrexbKmmLlB/2xSJNehQ zuenZR0qWPmvPing0Yi5k+ZkBLHdMbPMqOqZJQnQiVwt9tVjJhPoykkq92T+mJD7mQoa lni6bK8H6oHmdCTtAeO2TEYmshXmLFggiMg0gaFKq+ZH17jiPFxRK8ssBeviSxnhO72r lI7jc0oBbhqdTX8v8xLwtecgCnTQw//ddS5raCd0WoJiLtKDMxyfa5Lb5SlI3KVhG+b5 l3W4+EASOfHXCGRQqQM4MFMb6MuaBaR/gwe8C6CDdjetfO2m8GEvGkYZKTcWmA5kYsiw 4Bgg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252556; x=1731857356; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=G7YIma1TSdS4GVKncfhre22V8DGIHalzmKUoqzgwI0E=; b=ibJ8bYeUwUOD9PMh0GyPh4In5A9SRycCcDHQlzwNIDpJO2syGsP+2GHRbJuQy/0nuU F1OSnvYMKM+dEVmszsRxXokteQkV8P8qvbz9JLniPQ1jN8dUN9rxLUAT+gz0yETsR9Ir YCTbaWNRBxFV2cWjC0vQ1le5aYREpuqqRX6iPq05opWP5FrmQDR0S2cZ/qEZXU31jvCN 2MYg+MHw+86WNaXYoOVrFsp7g9Dh7iGfddn6MuL1X6pz/nnP694QJW+7Mylo5QkEQAYg 9BMisJ2Zp33tmUzyq9sQ2yKerKrOS5HDXCN4xuFc7aarYiRAHvFRhe9WSgHW6Jn1xG7s CaHA== X-Forwarded-Encrypted: i=1; AJvYcCXDf5aZZeFxDi6zIPV0DeTGjLaRPNnxX2O10An/OBM8hAyFE5Axx0oFI1Pj3n4h/uvIvs/+DSOpzPPVYZc=@vger.kernel.org X-Gm-Message-State: AOJu0Ywk/O1e7Dn1Td1DecfpTDbcXuYty7M9ISAsA7YpIGpEcE6VtTc5 M+HBwxE3AkCe8bR3NWx0DZFIi4wJl4bB6C/XuptcJYv53ONVJ+zBMlEv1E2B+90= X-Google-Smtp-Source: AGHT+IGMsIWblwQtQWO8JcRHCLRTN0pIGof64EqqksTz4TFFvZ2UD/wQcBuuSf4CK9eKe+atZiR/kw== X-Received: by 2002:a17:90b:4acb:b0:2e2:be64:488f with SMTP id 98e67ed59e1d1-2e9b1655945mr13617134a91.6.1731252555726; Sun, 10 Nov 2024 07:29:15 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.14 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:14 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 02/15] mm/readahead: add folio allocation helper Date: Sun, 10 Nov 2024 08:27:54 -0700 Message-ID: <20241110152906.1747545-3-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Just a wrapper around filemap_alloc_folio() for now, but add it in preparation for modifying the folio based on the 'ractl' being passed in. No functional changes in this patch. Signed-off-by: Jens Axboe --- mm/readahead.c | 16 +++++++++++----- 1 file changed, 11 insertions(+), 5 deletions(-) diff --git a/mm/readahead.c b/mm/readahead.c index 3dc6c7a128dd..003cfe79880d 100644 --- a/mm/readahead.c +++ b/mm/readahead.c @@ -188,6 +188,12 @@ static void read_pages(struct readahead_control *rac) BUG_ON(readahead_count(rac)); } =20 +static struct folio *ractl_alloc_folio(struct readahead_control *ractl, + gfp_t gfp_mask, unsigned int order) +{ + return filemap_alloc_folio(gfp_mask, order); +} + /** * page_cache_ra_unbounded - Start unchecked readahead. * @ractl: Readahead control. @@ -260,8 +266,8 @@ void page_cache_ra_unbounded(struct readahead_control *= ractl, continue; } =20 - folio =3D filemap_alloc_folio(gfp_mask, - mapping_min_folio_order(mapping)); + folio =3D ractl_alloc_folio(ractl, gfp_mask, + mapping_min_folio_order(mapping)); if (!folio) break; =20 @@ -431,7 +437,7 @@ static inline int ra_alloc_folio(struct readahead_contr= ol *ractl, pgoff_t index, pgoff_t mark, unsigned int order, gfp_t gfp) { int err; - struct folio *folio =3D filemap_alloc_folio(gfp, order); + struct folio *folio =3D ractl_alloc_folio(ractl, gfp, order); =20 if (!folio) return -ENOMEM; @@ -753,7 +759,7 @@ void readahead_expand(struct readahead_control *ractl, if (folio && !xa_is_value(folio)) return; /* Folio apparently present */ =20 - folio =3D filemap_alloc_folio(gfp_mask, min_order); + folio =3D ractl_alloc_folio(ractl, gfp_mask, min_order); if (!folio) return; =20 @@ -782,7 +788,7 @@ void readahead_expand(struct readahead_control *ractl, if (folio && !xa_is_value(folio)) return; /* Folio apparently present */ =20 - folio =3D filemap_alloc_folio(gfp_mask, min_order); + folio =3D ractl_alloc_folio(ractl, gfp_mask, min_order); if (!folio) return; =20 --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pj1-f43.google.com (mail-pj1-f43.google.com [209.85.216.43]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 6FB0F12F375 for ; Sun, 10 Nov 2024 15:29:18 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.43 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252560; cv=none; b=n9WaiGKHLie2miEDMbftNw1srXF96fkeD8chRR8fXTusreklIKsGxC/XaVK9pifjg5vx53MFDR8Cx1fvVGMnppdBDeXo5gS/auDOFQXwksythaX1bq9tXE+PZm4UjidQe1pEvZhND3cfrag2JVmNrGJ0oSQYWj0lX7TeSjr8hEw= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252560; c=relaxed/simple; bh=klEANlDo0QcivFe1NIYDNiOFa1lRCa2wf9JjLv/2My8=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=Xp+GwFOdRdq8Y1DSNhPSl80Rb95604HaVlIHaAEgRcqsbF98OBNyv6sM0hjiA5Qd+ULyp9S7/njseWtj5U6hRYKgkXoo5rJEE9+tx8ctb+JalWd8Lq+qirHy4UzjFQguF4RAI9G5IUxCtbRCTMGDnjmj6CvQtcMSee+4MHE9Nb8= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=Ouq80bgN; arc=none smtp.client-ip=209.85.216.43 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="Ouq80bgN" Received: by mail-pj1-f43.google.com with SMTP id 98e67ed59e1d1-2e2d1858cdfso2830518a91.1 for ; Sun, 10 Nov 2024 07:29:18 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252558; x=1731857358; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=9QDQ8u+XOjRX6rdX1KxWEpFxXqQ9HPCp5iTXnKjwIis=; b=Ouq80bgN+n62Hh9S9w/3uvP7bBk73EYoisR8aw+jBueVfm+r4+aWtgK9tEB+wPWIJ0 KTgwvudsL7/hDAnzDN948WPH195zh88m4Pv1TNRzon2pLz9Ea+JNHHKBC196Gu1FtZR1 /F1rRfYGoAHkKOjjx9KaWXSu6SINBA4LeRoPcUNL+7VzvudwSQNrvBCjxb4QxNTgDpqU WgQyQCEfKf05RFRBvSjs9tDDP6fcXnbrWYTxSkuKKlkKL10oAjxG4TK1Bk5mzgYxM2w1 0etMBm3oPlVdz9oNcJUtYzrxfMLHhmEHEE7pjqMr3w5mZ+5lJH0KAUKOtQ7VN5A3n+pZ 6XIw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252558; x=1731857358; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=9QDQ8u+XOjRX6rdX1KxWEpFxXqQ9HPCp5iTXnKjwIis=; b=NhcIUodZih6MncGL1ENyL6tat5liD7ZQJEUu/XHzsyH5ikVd4RkaC1PZ5vHMEKNhRl 6MYUkX8QDNWoGIgLSXbVer0yvbgWCuXC03v6eO91reibOrXotHSoEExSMejrIra3g8cg 4WG7odIXtuqNtifNtGwgpjuipFbrFrgbhUv/Q6yHVhAlQAraj5iPTQcQoa+Atg0Vz8FZ hsfmzpaSkDZ7oF+qse2tR/RkvHgGZxbVhfJVtZQnnrnIL3AAyMiw/PaN5QRLwz2Ox8rb 9RTLMFwGPTU2FxMU0OLDhRYlFo1xNakRSjZ6L1Sk6aU+J2nGS5FARAS066YDDcqLE57p sI8w== X-Forwarded-Encrypted: i=1; AJvYcCVcotGK8fdgRoN/UF0zKqtQVQun8x++ane0yvKwSQtRtUHox5W0VagD5rRz0VNURt87NRZ9sw7+QFcqDlQ=@vger.kernel.org X-Gm-Message-State: AOJu0YxJN/vWq15RFUf8URS0aeXVOfReFYw5SloIMDXVzh0SqDl4yJG7 dEt52Eo7STVyuxy7VuuvszleLGQRIH+624yeoyWQ3APCFarx0Q+un6z6n4huxBw= X-Google-Smtp-Source: AGHT+IEK+RdpBHKlpp5v8yHzxXuRsIFP9YbjrZgEGVdARDnq+2H6kYv6T3B7kfQSRNLPJFM3Uvu8sw== X-Received: by 2002:a17:90b:35cf:b0:2e9:48d0:3b59 with SMTP id 98e67ed59e1d1-2e9b16eb007mr12818842a91.8.1731252557781; Sun, 10 Nov 2024 07:29:17 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.16 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:16 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 03/15] mm: add PG_uncached page flag Date: Sun, 10 Nov 2024 08:27:55 -0700 Message-ID: <20241110152906.1747545-4-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Add a page flag that file IO can use to indicate that the IO being done is uncached, as in it should not persist in the page cache after the IO has been completed. Signed-off-by: Jens Axboe --- include/linux/page-flags.h | 5 +++++ include/trace/events/mmflags.h | 3 ++- 2 files changed, 7 insertions(+), 1 deletion(-) diff --git a/include/linux/page-flags.h b/include/linux/page-flags.h index cc839e4365c1..3c4003495929 100644 --- a/include/linux/page-flags.h +++ b/include/linux/page-flags.h @@ -110,6 +110,7 @@ enum pageflags { PG_reclaim, /* To be reclaimed asap */ PG_swapbacked, /* Page is backed by RAM/swap */ PG_unevictable, /* Page is "unevictable" */ + PG_uncached, /* uncached read/write IO */ #ifdef CONFIG_MMU PG_mlocked, /* Page is vma mlocked */ #endif @@ -562,6 +563,10 @@ PAGEFLAG(Reclaim, reclaim, PF_NO_TAIL) FOLIO_FLAG(readahead, FOLIO_HEAD_PAGE) FOLIO_TEST_CLEAR_FLAG(readahead, FOLIO_HEAD_PAGE) =20 +FOLIO_FLAG(uncached, FOLIO_HEAD_PAGE) + FOLIO_TEST_CLEAR_FLAG(uncached, FOLIO_HEAD_PAGE) + __FOLIO_SET_FLAG(uncached, FOLIO_HEAD_PAGE) + #ifdef CONFIG_HIGHMEM /* * Must use a macro here due to header dependency issues. page_zone() is n= ot diff --git a/include/trace/events/mmflags.h b/include/trace/events/mmflags.h index bb8a59c6caa2..b60057284102 100644 --- a/include/trace/events/mmflags.h +++ b/include/trace/events/mmflags.h @@ -116,7 +116,8 @@ DEF_PAGEFLAG_NAME(head), \ DEF_PAGEFLAG_NAME(reclaim), \ DEF_PAGEFLAG_NAME(swapbacked), \ - DEF_PAGEFLAG_NAME(unevictable) \ + DEF_PAGEFLAG_NAME(unevictable), \ + DEF_PAGEFLAG_NAME(uncached) \ IF_HAVE_PG_MLOCK(mlocked) \ IF_HAVE_PG_HWPOISON(hwpoison) \ IF_HAVE_PG_IDLE(idle) \ --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pj1-f45.google.com (mail-pj1-f45.google.com [209.85.216.45]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id DFD461369BC for ; Sun, 10 Nov 2024 15:29:19 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.45 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252562; cv=none; b=k7OEnW7qqIX5ar8aGPcUCyOh6/rceoS64JKxoBKfNLBwG0IABWQF6bVuGv9iaBjG+A2UNFZuKSSJFrp4095hvBsh8iouhpHI3mObKGjrcVykFL3QKEy77/4u4Cg4S3RVisk2epV7/xRUlNMbmzybL/cLtsZ/uOEx365FvJoQ/BU= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252562; c=relaxed/simple; bh=AFLrZl3fH+/hMhwZTjgZ0OWM0ndgzDi220n1zUDQffQ=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=HBAh30OH2l+fiWXiTk83FMK+TKGQgv0/xz46TVKUZfMJny5lCPodkz/KWVZZaYz2BQs0tvYNBM9Bp0tct5OxJvAzk9WVxi8RO/dUOHedGX3D7WMCKrcv0iFx+Da8EkjA/0SyZZf4de3TsKVgT/Pz21A2ayDtcNBsOPlmnXzC9aw= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=ARRTGlBZ; arc=none smtp.client-ip=209.85.216.45 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="ARRTGlBZ" Received: by mail-pj1-f45.google.com with SMTP id 98e67ed59e1d1-2e91403950dso2822713a91.3 for ; Sun, 10 Nov 2024 07:29:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252559; x=1731857359; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=jvfAlDie45/dO/q2Oa4nMUlSYRbUQddnQckYeJvFoiA=; b=ARRTGlBZxa5PHUsy9Q2tvu9pwq5hBq6iX1QXmL76/dMi8cd4bCAVhakos3YMhFgzxM 8L0F5BfA/KXHRhfZ+5fXXW3wO+YH7oZiJbghZtvvdOF1Zg0dWlXnAQOckFK6uHIOrq3z 3NMgg2Uk3HgKDmBEeCB/W4yVCBuF5a4ovwOgYDiEaoqon8TL8jt9CPezX4OQe6P6/tzM ikqmm5eCnG3bBlPI7w9H0f6sQLZGCByWLUs7u9Oyw2wdxEPovJEXROlrYoUZnbU5Fdqa nQXK9H8GE4ofJDQVp3nVL/DVlViWOayXNDNydyF2FEB4EBCB1nAUJ48JW2/o7aE3jkDA OFSg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252559; x=1731857359; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=jvfAlDie45/dO/q2Oa4nMUlSYRbUQddnQckYeJvFoiA=; b=XZ+HQksaDD9Ai5f1kMgCpVa73ehJzrQ4HxfuQAfL357cM1ccDYwl9c6x7khl17DH3S 6p72lkzuKOZlq+zh/vrlBK+3PFuSOIPqoB/Ml2k4il/kvLx4eQz1e4MutjQtrR3L4Mjt kISM0siQwb/E5/4TrUezUzxCZCRM24Jv36q/TKifSQXDeiPqqR5jitJSua5FAdONoknW fUs9VKnjz/iqg0ukDhR6sgNNr2XsyVRxnY1U81KI02+VNZTxb2uSCQp4yYnB2cOKKgRG NnIDmTgJYRiz9SxTv93HM0Aa6szDsu9Wf3LuVZeSPfOsXKZR1PoBkqbe0gKtj9/UV1d2 M/jw== X-Forwarded-Encrypted: i=1; AJvYcCX0fSXCANCvs1m6TAntVRGO1uKuthjqoqnRcr5uXsPUg9Mo4cInwO2RpNC+962E1mUM/M8+3tiUZIpjmWA=@vger.kernel.org X-Gm-Message-State: AOJu0Yw3bDvatI/RUhziI5yM4CEBAyAN+pP2cf6O3lEto3PTpQSREbMh dXQRbZu85j2wcNTTOaG41iDg2OEgbf2HDEK8Zd1IEm6MmihIiJyCNSoSuWx/f9w= X-Google-Smtp-Source: AGHT+IGu/u+FhGsz3wBr1Zsic2L7vJ+0uwzRrgnEJRvL3npe1QZdpxBowle6ldDc43UF6F8F8hYjFQ== X-Received: by 2002:a17:90b:2ecb:b0:2e5:5ab5:ba52 with SMTP id 98e67ed59e1d1-2e9b173f1ddmr13558707a91.20.1731252559257; Sun, 10 Nov 2024 07:29:19 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.17 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:18 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 04/15] mm/readahead: add readahead_control->uncached member Date: Sun, 10 Nov 2024 08:27:56 -0700 Message-ID: <20241110152906.1747545-5-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" If ractl->uncached is set to true, then folios created are marked as uncached as well. Signed-off-by: Jens Axboe --- include/linux/pagemap.h | 1 + mm/readahead.c | 8 +++++++- 2 files changed, 8 insertions(+), 1 deletion(-) diff --git a/include/linux/pagemap.h b/include/linux/pagemap.h index 68a5f1ff3301..8afacb7520d4 100644 --- a/include/linux/pagemap.h +++ b/include/linux/pagemap.h @@ -1350,6 +1350,7 @@ struct readahead_control { pgoff_t _index; unsigned int _nr_pages; unsigned int _batch_count; + bool uncached; bool _workingset; unsigned long _pflags; }; diff --git a/mm/readahead.c b/mm/readahead.c index 003cfe79880d..8dbeab9bc1f0 100644 --- a/mm/readahead.c +++ b/mm/readahead.c @@ -191,7 +191,13 @@ static void read_pages(struct readahead_control *rac) static struct folio *ractl_alloc_folio(struct readahead_control *ractl, gfp_t gfp_mask, unsigned int order) { - return filemap_alloc_folio(gfp_mask, order); + struct folio *folio; + + folio =3D filemap_alloc_folio(gfp_mask, order); + if (folio && ractl->uncached) + __folio_set_uncached(folio); + + return folio; } =20 /** --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pl1-f179.google.com (mail-pl1-f179.google.com [209.85.214.179]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 57127139566 for ; Sun, 10 Nov 2024 15:29:21 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.214.179 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252562; cv=none; b=JZeekY/K1tdWIdf7M2CDtdSu5vYK9jY7klVCvpQfqPZFYXtzqygINhOBEuo1k2agzfotihIwEmINE+VgnTBtBAPWE9dfDGK3vjGlXpQq42rYUQO7Sj7ApS6SCY97d8DK6UzzZIGGggYcWlbM9ciS1Mu6i6sDk6BOBt4fPNdTZak= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252562; c=relaxed/simple; bh=FAuL9fA2byaQyHKz3osLBcHrpXlsfUGCQq+TdYUSPV4=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=gxQSxlbbo44Ile806WCvZDyghMJfPfJualYo1VsIXgIYcOAmistEikXJZHDagopgGFHgqtn2NHdUvU1Kc8+W9fEghn43Frdgfccz8yCavdscSpUvH9maDMFso0hnlCnmspiw+ZIU7uGmK7wM6DpHgzuBgGexG5pC//hcrFmYmLI= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=qmYWEuXR; arc=none smtp.client-ip=209.85.214.179 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="qmYWEuXR" Received: by mail-pl1-f179.google.com with SMTP id d9443c01a7336-20cceb8d8b4so24320015ad.1 for ; Sun, 10 Nov 2024 07:29:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252560; x=1731857360; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=uN7tjL2s/tmxM6CJqz4ta5HuqLPCPNJgi5Uvj68apZw=; b=qmYWEuXR0dF4ao1qm23FHBkC1KFFPpXHCNxeCrVBSTu51G7sJ5NGRSHFiR/HEauAil 1bKE3AArI9plvFqdKod2vE1CxWBDCIPEvDdX8U3IE6Gmpu8W726qKIzA2pAqf75aHTw0 jNvaw6j86lAd8mX/79Ko2mKpI/5phTJOF7cGf+OA/QgV8fCts5OTxY7Ij3SYr3YQocl7 yPhrMab/vSExHReUDUim/U3rWGDLxsxkQe2y6fpSqhCE+A4Kd2ksgIayFprt4TuRAMJn XZsVkotq+HbOlVYdMp721vkbPTcIAWNiF1Bh5o8yvOOcoriMu5eu4xtCMNtFkWxUhsie d6NA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252560; x=1731857360; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=uN7tjL2s/tmxM6CJqz4ta5HuqLPCPNJgi5Uvj68apZw=; b=B3cME4tMOwLwsk8XYzUHFcps54gGrFiWFxixw3S7fh0Xe5N0F3AW66CS6qmSENE8t7 g1CBazWbehmArzIQ8qlSdnaxeEsw4QG264ZvipGAgC8R0nduQwkWgRNKUIw8ilknpmYA YN1xTDcFwPoaXLoqLGOM6j3ZWlqsewIAlx6y3XgRBqWOhSuXc7pTj3B5fNSv9wcTOHu8 0FnElEb3gfaKoB1Nv70gBaLnOg0nih/dcVOfDNdvT6TpbNWh/aTxY7n/i/9XmQ85y894 EVfns0ugboFD8SuteUHOmZD8V8Gp8Tb3LoWEh72Fbu+MemxWKzIVxXgwhpyMZahP9PxG W1Yw== X-Forwarded-Encrypted: i=1; AJvYcCWepAQ8NnS9xiF49k4nE4L6Wmv818woSTkkxHwUi+mddMBcnvoR0k9q8XHUucN2it6gMgVcMtvU1llhWoY=@vger.kernel.org X-Gm-Message-State: AOJu0YyY1sPqRVykA5Y1nn/8qjaDQANJRZ3dm5T11NwwpJFpI9pbNmmR Wc7suWbqQ7xhlvsRX+CCTASh/yBbC+LKYAA+kl5dbgLspBQE7qr1HQRnuhYzstQ= X-Google-Smtp-Source: AGHT+IEPVPrIsvzVKM9BRnKywqUBY+49ysTGl3h3gnVcSq9byO4UW1dwGcp/l7t7WWnOfvV7vMqjdA== X-Received: by 2002:a17:902:e812:b0:20c:f6c5:7f6c with SMTP id d9443c01a7336-211821c4546mr156324045ad.16.1731252560665; Sun, 10 Nov 2024 07:29:20 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.19 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:19 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 05/15] mm/filemap: use page_cache_sync_ra() to kick off read-ahead Date: Sun, 10 Nov 2024 08:27:57 -0700 Message-ID: <20241110152906.1747545-6-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Rather than use the page_cache_sync_readahead() helper, define our own ractl and use page_cache_sync_ra() directly. In preparation for needing to modify ractl inside filemap_get_pages(). No functional changes in this patch. Signed-off-by: Jens Axboe --- mm/filemap.c | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/mm/filemap.c b/mm/filemap.c index 0b187938b999..38dc94b761b7 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -2528,7 +2528,6 @@ static int filemap_get_pages(struct kiocb *iocb, size= _t count, { struct file *filp =3D iocb->ki_filp; struct address_space *mapping =3D filp->f_mapping; - struct file_ra_state *ra =3D &filp->f_ra; pgoff_t index =3D iocb->ki_pos >> PAGE_SHIFT; pgoff_t last_index; struct folio *folio; @@ -2543,12 +2542,13 @@ static int filemap_get_pages(struct kiocb *iocb, si= ze_t count, =20 filemap_get_read_batch(mapping, index, last_index - 1, fbatch); if (!folio_batch_count(fbatch)) { + DEFINE_READAHEAD(ractl, filp, &filp->f_ra, mapping, index); + if (iocb->ki_flags & IOCB_NOIO) return -EAGAIN; if (iocb->ki_flags & IOCB_NOWAIT) flags =3D memalloc_noio_save(); - page_cache_sync_readahead(mapping, ra, filp, index, - last_index - index); + page_cache_sync_ra(&ractl, last_index - index); if (iocb->ki_flags & IOCB_NOWAIT) memalloc_noio_restore(flags); filemap_get_read_batch(mapping, index, last_index - 1, fbatch); --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pg1-f176.google.com (mail-pg1-f176.google.com [209.85.215.176]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id E889613C683 for ; Sun, 10 Nov 2024 15:29:22 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.215.176 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252564; cv=none; b=pzaWCs8q9yxFpwOTnzMaNJRapsPmNAftm7+jIhDpCSH0wHj/ZI9Ir95Vh4XU/59zoZN6gRInhyununZms6M8oWGnKa5Yzf/2GzDJLJFbLkdZ3Yg77vEeyAo4V6xvkKoeHQoNeIyLAdDLs2SaWj2UUO6rrpp/eB9TsePio5gsUag= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252564; c=relaxed/simple; bh=aEnrAJGEshe44zlb6XbbKSp/FXpeH0RZvF9K0y5SLj4=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=dzjN/hL0ozaLM/hUPLD4mRgNNVvOQ49CMSmyjg8VpribhbMJp45ZACH+sSA+KxiGn/pqeKp4UJEaTcytqwpt83M6j2yjXxUTGDr1stypszp19ZCHrmDa7HlxN3mjfbHp97gknslIBo8KWG54FNCaKSi4/YQd7pRZDAJqzDeHV2s= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=ZfYZoDf+; arc=none smtp.client-ip=209.85.215.176 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="ZfYZoDf+" Received: by mail-pg1-f176.google.com with SMTP id 41be03b00d2f7-7f4324503d7so1549657a12.3 for ; Sun, 10 Nov 2024 07:29:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252562; x=1731857362; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=SJBisbj/rGvidSFxNEf05dxDvPbQrKvqbgVXmZ2uTrI=; b=ZfYZoDf+c7eHi5RDHnl2c3+aGOxYMh/suv0VAX/EdlySskaQd/kZLeqqhvnLCvEofX c0YWuNqg/JguAUt09caneDLVxGTZujlvsqfOu/vK+01pgqFvp4YrHOvooGMDCfSXp0nQ tUuhFDrkr5BRwMPIYZwVD2GtMn/g3yTZCOCK/N6zHI+7rBaps2s7kqqm+HjbsWW4WJI+ c+eedGZy5PrI+Kk3WzBak8rf300HSTYVdPfuZvNPmRvM/pklP8IqE/CIWJGEjjYb7qBU yrXnlnuNudylAeDbLPKeI3pn3R47UgLsQc5WTyy7srFK+oQCrEOlL7Tos+6gWGX4vlw/ woQA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252562; x=1731857362; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=SJBisbj/rGvidSFxNEf05dxDvPbQrKvqbgVXmZ2uTrI=; b=TKRW2dHcoXKIRqxcak2d55MICxzOYasEQuboPhjo4FAavF3eDuyO0np2dQ6CGOxPTs t+Gx3clXbkov7DgWJ0Z2ptaDFESpCrP4JSQcabvN9nhlah5cZnRDEVCNnqipIAJLYUAl zgbtAg2QmTfc7ixgwFNeczf4Sghlfvxfmi3prV+fwUhVPJ/DAdVhVk8orZ/klqBH19YR ZtH7/papq/FF55IPCl+NF8LAtt7NWWwLvpUBDFyDi4kbM3bUrrCui8H433XqbDxHA5kc JW1Fs5UVUC1YWGzyz2TA7ENktrFPaIlk25cW77stB8ktF5OueOX33LtiuctCWjq3nJmU FdxQ== X-Forwarded-Encrypted: i=1; AJvYcCVxXlmss7KLIvwyWUSjemKBp+g12btvdv9QyF5aETp0/fwSuCyOSmD0O2jGnAW5jZOqBSrVeErB+YHdue0=@vger.kernel.org X-Gm-Message-State: AOJu0YzXKkoJJH5Okg7AmOfsbD37fFZ1NVoJsl1h0O8GQu/NRW8NvB6Q RJWi/dopUnwcJSg8dOfTUyENVmHUGybOCj52BkP3nYEhJFYBoi+YUVzAPqvEUuM= X-Google-Smtp-Source: AGHT+IEztLXy7pfG6RYnicaB5W8MU9mXcvU3VePtiJO2A6vGFaDJWmS6lrttgNl/g5B6W7Nsuo57hw== X-Received: by 2002:a17:90b:224f:b0:2e2:da6e:8807 with SMTP id 98e67ed59e1d1-2e9b177fc52mr13814904a91.26.1731252562169; Sun, 10 Nov 2024 07:29:22 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.20 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:21 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 06/15] mm/truncate: make invalidate_complete_folio2() public Date: Sun, 10 Nov 2024 08:27:58 -0700 Message-ID: <20241110152906.1747545-7-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Make invalidate_complete_folio2() be publicly available, and have it take a gfp_t mask as well rather than hardcode GFP_KERNEL. The only caller just passes in GFP_KERNEL, no functional changes in this patch. Signed-off-by: Jens Axboe --- include/linux/pagemap.h | 2 ++ mm/truncate.c | 9 +++++---- 2 files changed, 7 insertions(+), 4 deletions(-) diff --git a/include/linux/pagemap.h b/include/linux/pagemap.h index 8afacb7520d4..0122b3fbe2ac 100644 --- a/include/linux/pagemap.h +++ b/include/linux/pagemap.h @@ -34,6 +34,8 @@ int kiocb_invalidate_pages(struct kiocb *iocb, size_t cou= nt); void kiocb_invalidate_post_direct_write(struct kiocb *iocb, size_t count); int filemap_invalidate_pages(struct address_space *mapping, loff_t pos, loff_t end, bool nowait); +int invalidate_complete_folio2(struct address_space *mapping, + struct folio *folio, gfp_t gfp_mask); =20 int write_inode_now(struct inode *, int sync); int filemap_fdatawrite(struct address_space *); diff --git a/mm/truncate.c b/mm/truncate.c index 0668cd340a46..e084f7aa9370 100644 --- a/mm/truncate.c +++ b/mm/truncate.c @@ -546,13 +546,13 @@ EXPORT_SYMBOL(invalidate_mapping_pages); * shrink_folio_list() has a temp ref on them, or because they're transien= tly * sitting in the folio_add_lru() caches. */ -static int invalidate_complete_folio2(struct address_space *mapping, - struct folio *folio) +int invalidate_complete_folio2(struct address_space *mapping, + struct folio *folio, gfp_t gfp_mask) { if (folio->mapping !=3D mapping) return 0; =20 - if (!filemap_release_folio(folio, GFP_KERNEL)) + if (!filemap_release_folio(folio, gfp_mask)) return 0; =20 spin_lock(&mapping->host->i_lock); @@ -650,7 +650,8 @@ int invalidate_inode_pages2_range(struct address_space = *mapping, =20 ret2 =3D folio_launder(mapping, folio); if (ret2 =3D=3D 0) { - if (!invalidate_complete_folio2(mapping, folio)) + if (!invalidate_complete_folio2(mapping, folio, + GFP_KERNEL)) ret2 =3D -EBUSY; } if (ret2 < 0) --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pj1-f53.google.com (mail-pj1-f53.google.com [209.85.216.53]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 805E11547C8 for ; Sun, 10 Nov 2024 15:29:24 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.53 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252565; cv=none; b=m0l3Cjc4F8Iyt4/JSeGskAwBC0xkB5WrsaL4b/miInhPyZJcnTdtgoOgmQDnYbxEeYCfQ44xZKinK4rmZMbC39Rea0ZxU2luKDdVQhq3IVGnYyUFbZCgK9sua7qJEn4sL7HyCXRendrBumb+r4FclbFBq5ixo+c54yZXXj/Jsug= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252565; c=relaxed/simple; bh=QqsnT+PFB0stp1XOLxW6GR03t+Q+Rw9k/OtfDS+hMd8=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=gjc57iwA7sOyQDU4AugqjIBp8NxLKms9JE6P4Vh4VslYPorfY9TKa6lZHVjMu8IFmzfM4F4vTxyOk43N6u/hoBGeDFzAsy0Y5S6mJHivX5aIdaSRs718mGEH9tIPKt5gFgGpQsNxFIsSUGhrGxpjXCLVG2mT9yb/OJiAjW77ikA= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=ITqFiO9j; arc=none smtp.client-ip=209.85.216.53 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="ITqFiO9j" Received: by mail-pj1-f53.google.com with SMTP id 98e67ed59e1d1-2e2eba31d3aso2824201a91.2 for ; Sun, 10 Nov 2024 07:29:24 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252564; x=1731857364; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=BpaeGG8sCxZf4JRh/p7NGjZl2gbHvBRPjf0TUvSvRrE=; b=ITqFiO9jdn5hfePmhO3CDfJ4/Dmi47iutuGrtDiahBPQhv0vuNt8DvJBOLO3Vxbm3j csJv0r53dA3Iw5r4Oyz+o8HNT+AnndUKoaQQv8Plso7XHK9X5ztaaObQL/PX9x5+v3AK svwxzNw08U3xQl9ftcmwmXj8N0SH1rhpCyajbwU9uOHps8jGHylvxgWT6cEhoORiWMSc DfWarKen3KehFQL5V0oQ4lXV4hOKfHYjbrxiU1wH/GwCNC9ZtZ/ylBhYS2sSgy4LlVIA 7iYkVAJkiGiVf6foE4lxi3B77XWykVOy6Ww0w2X40CY/7+KkhxAXQ/vRX8B39Zjr/M4y by3Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252564; x=1731857364; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=BpaeGG8sCxZf4JRh/p7NGjZl2gbHvBRPjf0TUvSvRrE=; b=Jw1NWkoZGSBedFMBBNWb0ryzxoG8LmIxHdk6Sq1CR4qnlJDACMyQP4GU7Ww/P7WlrR sBR8z0MpUhRKfhLK4dgtIlLdCUsZY4Hp4rEkLa38CM75S500vrgzsvhXGDtEqozs2PJK kfBDXHTaXd9pGdhzEtkOqR811StlF9qkq/c2hxmVLXmVr6tZy88ng/2NrSWR6mQ53eKg J/EVXTRdRnf0WK9v/V8LQa8ej94Ljta9edG9rg2XGeQFUxDbjRd2izOAXp7zfrpDZBng WPZbbnxSB4BHjgRhwaq8LiddgcXOFRoo2q4mFVC1BF7l/378/8DEYGkUsabNzJz+Ohaq 9Y1g== X-Forwarded-Encrypted: i=1; AJvYcCUOdDXQWCnyld/CeEty2d7+YdG9MjD2oy5BGVYSeOQsWwqQT9v7a1z/yHT3E1oDM+o9LR23YE9PoEj0oTk=@vger.kernel.org X-Gm-Message-State: AOJu0YxXguAxmnwSH/B5yG/0lVh0PiQ8pzvtZQ5k0QjmbKCsHYO8YdoJ 1gBn/9pn+jZBH+WdC96qSvRrchERhwaIuE/Z+rAvWl1F4kUJFwQXMgfqPKu+Op0= X-Google-Smtp-Source: AGHT+IGs3z/1IuRBhhgcQrQmIVWwPr43n/jctzILt+apLswhddJAGBym568yRqijFSP667Ww7gbDZA== X-Received: by 2002:a17:90b:1f8e:b0:2e0:d957:1b9d with SMTP id 98e67ed59e1d1-2e9b17163cbmr13877494a91.13.1731252563753; Sun, 10 Nov 2024 07:29:23 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.22 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:23 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 07/15] fs: add RWF_UNCACHED iocb and FOP_UNCACHED file_operations flag Date: Sun, 10 Nov 2024 08:27:59 -0700 Message-ID: <20241110152906.1747545-8-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" If a file system supports uncached buffered IO, it may set FOP_UNCACHED and enable RWF_UNCACHED. If RWF_UNCACHED is attempted without the file system supporting it, it'll get errored with -EOPNOTSUPP. Signed-off-by: Jens Axboe --- include/linux/fs.h | 10 +++++++++- include/uapi/linux/fs.h | 6 +++++- 2 files changed, 14 insertions(+), 2 deletions(-) diff --git a/include/linux/fs.h b/include/linux/fs.h index 3559446279c1..5abc53991cd0 100644 --- a/include/linux/fs.h +++ b/include/linux/fs.h @@ -320,6 +320,7 @@ struct readahead_control; #define IOCB_NOWAIT (__force int) RWF_NOWAIT #define IOCB_APPEND (__force int) RWF_APPEND #define IOCB_ATOMIC (__force int) RWF_ATOMIC +#define IOCB_UNCACHED (__force int) RWF_UNCACHED =20 /* non-RWF related bits - start at 16 */ #define IOCB_EVENTFD (1 << 16) @@ -354,7 +355,8 @@ struct readahead_control; { IOCB_SYNC, "SYNC" }, \ { IOCB_NOWAIT, "NOWAIT" }, \ { IOCB_APPEND, "APPEND" }, \ - { IOCB_ATOMIC, "ATOMIC"}, \ + { IOCB_ATOMIC, "ATOMIC" }, \ + { IOCB_UNCACHED, "UNCACHED" }, \ { IOCB_EVENTFD, "EVENTFD"}, \ { IOCB_DIRECT, "DIRECT" }, \ { IOCB_WRITE, "WRITE" }, \ @@ -2116,6 +2118,8 @@ struct file_operations { #define FOP_HUGE_PAGES ((__force fop_flags_t)(1 << 4)) /* Treat loff_t as unsigned (e.g., /dev/mem) */ #define FOP_UNSIGNED_OFFSET ((__force fop_flags_t)(1 << 5)) +/* File system supports uncached read/write buffered IO */ +#define FOP_UNCACHED ((__force fop_flags_t)(1 << 6)) =20 /* Wrap a directory iterator that needs exclusive inode access */ int wrap_directory_iterator(struct file *, struct dir_context *, @@ -3532,6 +3536,10 @@ static inline int kiocb_set_rw_flags(struct kiocb *k= i, rwf_t flags, if (!(ki->ki_filp->f_mode & FMODE_CAN_ATOMIC_WRITE)) return -EOPNOTSUPP; } + if (flags & RWF_UNCACHED) { + if (!(ki->ki_filp->f_op->fop_flags & FOP_UNCACHED)) + return -EOPNOTSUPP; + } kiocb_flags |=3D (__force int) (flags & RWF_SUPPORTED); if (flags & RWF_SYNC) kiocb_flags |=3D IOCB_DSYNC; diff --git a/include/uapi/linux/fs.h b/include/uapi/linux/fs.h index 753971770733..dc77cd8ae1a3 100644 --- a/include/uapi/linux/fs.h +++ b/include/uapi/linux/fs.h @@ -332,9 +332,13 @@ typedef int __bitwise __kernel_rwf_t; /* Atomic Write */ #define RWF_ATOMIC ((__force __kernel_rwf_t)0x00000040) =20 +/* buffered IO that drops the cache after reading or writing data */ +#define RWF_UNCACHED ((__force __kernel_rwf_t)0x00000080) + /* mask of flags supported by the kernel */ #define RWF_SUPPORTED (RWF_HIPRI | RWF_DSYNC | RWF_SYNC | RWF_NOWAIT |\ - RWF_APPEND | RWF_NOAPPEND | RWF_ATOMIC) + RWF_APPEND | RWF_NOAPPEND | RWF_ATOMIC |\ + RWF_UNCACHED) =20 #define PROCFS_IOCTL_MAGIC 'f' =20 --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pj1-f52.google.com (mail-pj1-f52.google.com [209.85.216.52]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id F375D156F55 for ; Sun, 10 Nov 2024 15:29:25 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.52 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252568; cv=none; b=rJ4yZAVMFjRw+YYVQAeIZVDoPPF3lNJWk0oONDCfXmT7mbvEAWN/XnoVXNl7PIRGCPFkMgq9RUzmMsK9ZkWOltWS9qpLJYP2xgQjcqfg3Wnx9rxoo29praWwuLls6mOKTYBsy3GJALkrsUzz/SAdQULa5jHgKNvW2ejbWaBMDeQ= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252568; c=relaxed/simple; bh=GtEZ1vh2U+C3BSovAlp6aWdv1IA1Yxh/R7JebXsMPYk=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=gM5MTfU2yRUqR/IoVrNWsNH8yyxyfv0pfgN7eqmJ+9n6GzT7A4Nc2hzBTXVWidqHwiye/Z5ML5MUgwdnNM00lAl2q3ysjejzC9QeZV0eqAP3LloXbJ9YvR6IA1IXmUXRz3XTY2M3v9oLE5SK2uiBg969tKlqsSlOT7mJYh7BYbk= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=HHaWkcpa; arc=none smtp.client-ip=209.85.216.52 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="HHaWkcpa" Received: by mail-pj1-f52.google.com with SMTP id 98e67ed59e1d1-2e2a999b287so3001954a91.0 for ; Sun, 10 Nov 2024 07:29:25 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252565; x=1731857365; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=f2GrDSrtQqt8cMSioXW5eWMYaGSDX9LD8WbU7yXVjqU=; b=HHaWkcpaZkDLnKu53ZAGR9OB/MAQgZlLs61GpZhp9XOUDwp0vjTxEsTwscDQ9z9Gnp SOgjDyWlUXTFMieY/hjSZM2vRS51GGsBObOTwjUP4AWPbHxmJA1Tx1Q7EIuPfXwXcRTs /s9HKiYJMd5pH8ng5ngieLvwhWmkMCwguQy2yyPDTNMTXvy+9Rhqr8IOOwLHNr0aB73x KwWJHy7Owdp77qRB5HtDHb+MlhHt6GBFjfk/DtpKHW2oxSAOhmFQetclgajE1hrUR1j3 ryEa+/KaaacpqB3fo9+k3R+ZlkuIHuxJOkT8dXl/rsz6rAwHE7FO6ITIQqSv+1P+6TxN n8pQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252565; x=1731857365; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=f2GrDSrtQqt8cMSioXW5eWMYaGSDX9LD8WbU7yXVjqU=; b=q2TA9u3PLV0gj5aVNNz2ZqgYzySWljcLpx+Xir2wjAv+T5+qOBOm+r28F6ndCFaIQ7 Q/unm9oc4SCxYEGLZnKL0ujHLY99Eg3qZOEL9EmE71AkJc9qOxij0vcYYknMLq0iWYFX Q8KluLyB1rlh4+4acfdI6jVHcb8fgqxXKA79AO/wiw9O/70cZNsDwUyFSn/fHoVDInlq ew1ePlUVWqi1A3v9yKZPfLg7pebbHgQAi+6M8HFMsbu7mv6e05Wu/NIdnJSOownwa0wK UqtgyaBIvo9bwqgbXPlfQt2+Q5KS4MxECc9M5qgY9ENTtd+wQAh7VqBVo3gWXAfCZTXT qDFg== X-Forwarded-Encrypted: i=1; AJvYcCU2ZiBwqbZzW91ATyxJ0JQF5gAAgtJ6Wyh+UbGzvpoas3/68TXGfOwn0olqjh6BZKuN+VX9LM5nX5u8lxU=@vger.kernel.org X-Gm-Message-State: AOJu0YysDZfG4X7UHNVUxw3V6S4i4QWuLS58iJovLhcCKWH6XMC1s73R Y6dX5pziItyaHj56BDk+3QYQ8witu/alduJYgVO6MLdYid6wSetXM7qB+2eEJn8= X-Google-Smtp-Source: AGHT+IHD5ZOaM1/ydWmKI6E2td5s5ih4bl0qHFridFSAcX9UJ78BlPPQ+BWJS6tide+7RYqVyEL5yA== X-Received: by 2002:a17:90b:2748:b0:2d8:840b:9654 with SMTP id 98e67ed59e1d1-2e9b1754affmr13371709a91.34.1731252565293; Sun, 10 Nov 2024 07:29:25 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.23 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:24 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 08/15] mm/filemap: add read support for RWF_UNCACHED Date: Sun, 10 Nov 2024 08:28:00 -0700 Message-ID: <20241110152906.1747545-9-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Add RWF_UNCACHED as a read operation flag, which means that any data read wil be removed from the page cache upon completion. Uses the page cache to synchronize, and simply prunes folios that were instantiated when the operation completes. While it would be possible to use private pages for this, using the page cache as synchronization is handy for a variety of reasons: 1) No special truncate magic is needed 2) Async buffered reads need some place to serialize, using the page cache is a lot easier than writing extra code for this 3) The pruning cost is pretty reasonable and the code to support this is much simpler as a result. You can think of uncached buffered IO as being the much more attractive cousing of O_DIRECT - it has none of the restrictions of O_DIRECT. Yes, it will copy the data, but unlike regular buffered IO, it doesn't run into the unpredictability of the page cache in terms of reclaim. As an example, on a test box with 32 drives, reading them with buffered IO looks as follows: Reading bs 65536, uncached 0 1s: 145945MB/sec 2s: 158067MB/sec 3s: 157007MB/sec 4s: 148622MB/sec 5s: 118824MB/sec 6s: 70494MB/sec 7s: 41754MB/sec 8s: 90811MB/sec 9s: 92204MB/sec 10s: 95178MB/sec 11s: 95488MB/sec 12s: 95552MB/sec 13s: 96275MB/sec where it's quite easy to see where the page cache filled up, and performance went from good to erratic, and finally settles at a much lower rate. Looking at top while this is ongoing, we see: PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 7535 root 20 0 267004 0 0 S 3199 0.0 8:40.65 uncached 3326 root 20 0 0 0 0 R 100.0 0.0 0:16.40 kswapd4 3327 root 20 0 0 0 0 R 100.0 0.0 0:17.22 kswapd5 3328 root 20 0 0 0 0 R 100.0 0.0 0:13.29 kswapd6 3332 root 20 0 0 0 0 R 100.0 0.0 0:11.11 kswapd10 3339 root 20 0 0 0 0 R 100.0 0.0 0:16.25 kswapd17 3348 root 20 0 0 0 0 R 100.0 0.0 0:16.40 kswapd26 3343 root 20 0 0 0 0 R 100.0 0.0 0:16.30 kswapd21 3344 root 20 0 0 0 0 R 100.0 0.0 0:11.92 kswapd22 3349 root 20 0 0 0 0 R 100.0 0.0 0:16.28 kswapd27 3352 root 20 0 0 0 0 R 99.7 0.0 0:11.89 kswapd30 3353 root 20 0 0 0 0 R 96.7 0.0 0:16.04 kswapd31 3329 root 20 0 0 0 0 R 96.4 0.0 0:11.41 kswapd7 3345 root 20 0 0 0 0 R 96.4 0.0 0:13.40 kswapd23 3330 root 20 0 0 0 0 S 91.1 0.0 0:08.28 kswapd8 3350 root 20 0 0 0 0 S 86.8 0.0 0:11.13 kswapd28 3325 root 20 0 0 0 0 S 76.3 0.0 0:07.43 kswapd3 3341 root 20 0 0 0 0 S 74.7 0.0 0:08.85 kswapd19 3334 root 20 0 0 0 0 S 71.7 0.0 0:10.04 kswapd12 3351 root 20 0 0 0 0 R 60.5 0.0 0:09.59 kswapd29 3323 root 20 0 0 0 0 R 57.6 0.0 0:11.50 kswapd1 [...] which is just showing a partial list of the 32 kswapd threads that are running mostly full tilt, burning ~28 full CPU cores. If the same test case is run with RWF_UNCACHED set for the buffered read, the output looks as follows: Reading bs 65536, uncached 0 1s: 153144MB/sec 2s: 156760MB/sec 3s: 158110MB/sec 4s: 158009MB/sec 5s: 158043MB/sec 6s: 157638MB/sec 7s: 157999MB/sec 8s: 158024MB/sec 9s: 157764MB/sec 10s: 157477MB/sec 11s: 157417MB/sec 12s: 157455MB/sec 13s: 157233MB/sec 14s: 156692MB/sec which is just chugging along at ~155GB/sec of read performance. Looking at top, we see: PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 7961 root 20 0 267004 0 0 S 3180 0.0 5:37.95 uncached 8024 axboe 20 0 14292 4096 0 R 1.0 0.0 0:00.13 top where just the test app is using CPU, no reclaim is taking place outside of the main thread. Not only is performance 65% better, it's also using half the CPU to do it. Signed-off-by: Jens Axboe --- mm/filemap.c | 18 ++++++++++++++++-- mm/swap.c | 2 ++ 2 files changed, 18 insertions(+), 2 deletions(-) diff --git a/mm/filemap.c b/mm/filemap.c index 38dc94b761b7..bd698340ef24 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -2474,6 +2474,8 @@ static int filemap_create_folio(struct kiocb *iocb, folio =3D filemap_alloc_folio(mapping_gfp_mask(mapping), min_order); if (!folio) return -ENOMEM; + if (iocb->ki_flags & IOCB_UNCACHED) + __folio_set_uncached(folio); =20 /* * Protect against truncate / hole punch. Grabbing invalidate_lock @@ -2519,6 +2521,8 @@ static int filemap_readahead(struct kiocb *iocb, stru= ct file *file, =20 if (iocb->ki_flags & IOCB_NOIO) return -EAGAIN; + if (iocb->ki_flags & IOCB_UNCACHED) + ractl.uncached =3D 1; page_cache_async_ra(&ractl, folio, last_index - folio->index); return 0; } @@ -2548,6 +2552,8 @@ static int filemap_get_pages(struct kiocb *iocb, size= _t count, return -EAGAIN; if (iocb->ki_flags & IOCB_NOWAIT) flags =3D memalloc_noio_save(); + if (iocb->ki_flags & IOCB_UNCACHED) + ractl.uncached =3D 1; page_cache_sync_ra(&ractl, last_index - index); if (iocb->ki_flags & IOCB_NOWAIT) memalloc_noio_restore(flags); @@ -2706,8 +2712,16 @@ ssize_t filemap_read(struct kiocb *iocb, struct iov_= iter *iter, } } put_folios: - for (i =3D 0; i < folio_batch_count(&fbatch); i++) - folio_put(fbatch.folios[i]); + for (i =3D 0; i < folio_batch_count(&fbatch); i++) { + struct folio *folio =3D fbatch.folios[i]; + + if (folio_test_uncached(folio)) { + folio_lock(folio); + invalidate_complete_folio2(mapping, folio, 0); + folio_unlock(folio); + } + folio_put(folio); + } folio_batch_init(&fbatch); } while (iov_iter_count(iter) && iocb->ki_pos < isize && !error); =20 diff --git a/mm/swap.c b/mm/swap.c index 835bdf324b76..f2457acae383 100644 --- a/mm/swap.c +++ b/mm/swap.c @@ -472,6 +472,8 @@ static void folio_inc_refs(struct folio *folio) */ void folio_mark_accessed(struct folio *folio) { + if (folio_test_uncached(folio)) + return; if (lru_gen_enabled()) { folio_inc_refs(folio); return; --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pl1-f176.google.com (mail-pl1-f176.google.com [209.85.214.176]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 46738157495 for ; Sun, 10 Nov 2024 15:29:27 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.214.176 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252569; cv=none; b=r2mZP7l6PckF2uO1c8GtxXCyodJBselXFr7lgeQKLao0wIkI8Bsru0xLETiy+DlPrrTgV25Y16FQwoz4nq08jGsH9hVeqNUo2lcklewNTLQJJOejBUnWdhEROVfHOYFo3qmD4peIHUHgf0/MPUzgbsWdmC0vJyjU55txi//IbH8= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252569; c=relaxed/simple; bh=AKt+S745U93BEmbA6t/wcRBIgQj/soY+rT3nJS7qXJU=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=LD4RIP+zzSqoN3B2FslaPh4xMUKPnPkufEjX5gwVsf02GV5R02GXbLhe6v9Pz58g0JjWzCc6N2VL/HPsybsmieSAHpTsmo5qUU2pU9iBOCUk0S1D10htoY5iFC0l4tmH+wziG+e/wRFVt6vZ7RJiee0BJq+k9k//w3XsIKSsGBU= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=jpxkdGSA; arc=none smtp.client-ip=209.85.214.176 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="jpxkdGSA" Received: by mail-pl1-f176.google.com with SMTP id d9443c01a7336-20c9978a221so42261175ad.1 for ; Sun, 10 Nov 2024 07:29:27 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252566; x=1731857366; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=xtqsp3FpRWE9h3nra5m9D1cpUkXI8hkSCKuFCrWSblw=; b=jpxkdGSACE2RIfxZR30I6RLxlbnIVcV8CQP+caD3TJTf9MGEGC2rENCIMrdqFSyPRW sximQLDX7ngw6JwKo6AoBMqN06YroHpC69obYTg9zcR5TfthsZZKuRbYrE4DxXnDpU+W yNp+1ynCzu3wN3TExVe/C0PsKAYsSkZusARdnCjiszGVukDMpprVY8wcpxSJn8tJNM/A DojcecxZxsAZafCdfcjlenpPY0f1UhUBl2p0KFkYWWdxG3LXKTHoTUbrm22m5NvlpL9D tfYK5FwXHhCl7N+++ecZbUuTVnkhOQeBJHYZ0QrL8yM7tLJY1HQ1KkcNb8WwDQTc8uV+ pAjw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252566; x=1731857366; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=xtqsp3FpRWE9h3nra5m9D1cpUkXI8hkSCKuFCrWSblw=; b=c/J9HfKj0Zz4nLmb2lkZGl2BCKcbfiO/U1zLLyR8gjwQWsyQHfhVIlQph8ceRWj6Yo zIpSgR2Un1m9xjDKVvjL+TBc6t0f6zNGGDAwRqJktLpNwE12XQSykGDag7+suPIF1TYb VF+5gZzm9Bx6xHETFCPifnPwQk1pKpVWy2quQUrGRp/rGhnFP5XL1wmkyIkKbI3t16SB pOS+ZtynNgfeO3ORjj693yRDSPBsHw/PKQVRWOlMrsVDpyhvNVsuYipVQ12UBgkq39dp GoVDIyTaJpcCGYnuHylZw0fAD9epF7MOMyElTlAJaTmj8bS8iqC3/QVda7Mo7LMfc8pZ pP3w== X-Forwarded-Encrypted: i=1; AJvYcCWFW7s0rMQpjICmIPl9DFfMUoIGEIzNGrQFNgMnOQgMtSFlPnWWOMFe4TeR67bmLour5o8gGb4xh6YOMv0=@vger.kernel.org X-Gm-Message-State: AOJu0YwWy/reIqrfJjqdKR5f+WTm0pMAfC8k+5krfbFKM3s+zBHNkVCz iuImEQjfPUG/RXg+JjzaF1AIQAu1LQLWEP23uxnEUhlpFAdEX5GiVeGSotxCFP4= X-Google-Smtp-Source: AGHT+IF5qTNuztLawL22nFdoWM3hBa8g+iEMn3AucUKZX2m0zot7CflQJgWfjjOvLTZBT0mpwh+/bA== X-Received: by 2002:a17:902:cf02:b0:20c:af07:a816 with SMTP id d9443c01a7336-21183d087c7mr129753345ad.31.1731252566678; Sun, 10 Nov 2024 07:29:26 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.25 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:25 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 09/15] mm/filemap: drop uncached pages when writeback completes Date: Sun, 10 Nov 2024 08:28:01 -0700 Message-ID: <20241110152906.1747545-10-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" If the folio is marked as uncached, drop pages when writeback completes. Intended to be used with RWF_UNCACHED, to avoid needing sync writes for uncached IO. Signed-off-by: Jens Axboe --- mm/filemap.c | 23 +++++++++++++++++++++++ 1 file changed, 23 insertions(+) diff --git a/mm/filemap.c b/mm/filemap.c index bd698340ef24..efd02b047541 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -1600,6 +1600,23 @@ int folio_wait_private_2_killable(struct folio *foli= o) } EXPORT_SYMBOL(folio_wait_private_2_killable); =20 +/* + * If folio was marked as uncached, then pages should be dropped when writ= eback + * completes. Do that now. If we fail, it's likely because of a big folio - + * just reset uncached for that case and latter completions should invalid= ate. + */ +static void folio_end_uncached(struct folio *folio) +{ + bool reset =3D true; + + if (folio_trylock(folio)) { + reset =3D !invalidate_complete_folio2(folio->mapping, folio, 0); + folio_unlock(folio); + } + if (reset) + folio_set_uncached(folio); +} + /** * folio_end_writeback - End writeback against a folio. * @folio: The folio. @@ -1610,6 +1627,8 @@ EXPORT_SYMBOL(folio_wait_private_2_killable); */ void folio_end_writeback(struct folio *folio) { + bool folio_uncached; + VM_BUG_ON_FOLIO(!folio_test_writeback(folio), folio); =20 /* @@ -1631,9 +1650,13 @@ void folio_end_writeback(struct folio *folio) * reused before the folio_wake_bit(). */ folio_get(folio); + folio_uncached =3D folio_test_clear_uncached(folio); if (__folio_end_writeback(folio)) folio_wake_bit(folio, PG_writeback); acct_reclaim_writeback(folio); + + if (folio_uncached) + folio_end_uncached(folio); folio_put(folio); } EXPORT_SYMBOL(folio_end_writeback); --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pl1-f178.google.com (mail-pl1-f178.google.com [209.85.214.178]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id DBC38158208 for ; Sun, 10 Nov 2024 15:29:28 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.214.178 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252570; cv=none; b=HqhdAidESNrq58qb2aWMF87aIPXxn/9baZrkPJjwMtbfKiLea4tgdBuiJ6+wgBLbHOqDl5uZKx81n4eeObwx6KKgq25UZLaxAaa1kkbqzeWK+ZKqFo6VJADuS7K14eUIoo8DJ1NOYjTovpe2ieI6eEEj/LUv5j9+oK0/m38ib9w= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252570; c=relaxed/simple; bh=DvOUyhYLGoxN2Hc3acnkKQpks6bE7rAJDoBJQe3HSFQ=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=hXJsJS5RS5TYJlg3YCaF4Srw2UGLFNr6MNo+L4FIJuZrSp2hTc7Y4ty+DvWM5fj95WqtddqzuIGmbsJr1vd08vnEPkINtV6rdlr8qUjXHskFvKX0zUA3DEAWrqMTlhVmJwFm4CTNtm1IepnzAX0oywgc3nwrLatOnyLeQXs+Ikg= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=jDTXabpf; arc=none smtp.client-ip=209.85.214.178 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="jDTXabpf" Received: by mail-pl1-f178.google.com with SMTP id d9443c01a7336-20e6981ca77so42845775ad.2 for ; Sun, 10 Nov 2024 07:29:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252568; x=1731857368; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=c6ru6+H4oQlmjhy4ZS59TJ0rgrpyxDTaEbRj/Vr+gBQ=; b=jDTXabpfNmkfTfN2pKKft5LnVS8ofQE3fRSSULoPO1MAPz4TIQ+PKVaf7z0nIi0kSO KvragGR9eX+cguqb+WAeeso1/5o2cWWY6UyU6Hx6wZA3vNGcq5XOUR/iWOTYfYhABKvp KZRFEfWMokybmbHgkrKIRLqw4ChQvepExYoYND9RJ9OnCER2tIexck99O/ZvAZTa3Buc CPoGYkOnUEoEx4kbQ1ir7xeL0g3xVNL0S/AmGa9T8ktMxatRNVoDfc1jW0R2MtFnEErs rOxpD/plCD7khf76VgDabfo57/lzrl+xXlCt+c8ofJpdhN2AswS/fI/cAQJjgbAuqfv2 gsbg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252568; x=1731857368; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=c6ru6+H4oQlmjhy4ZS59TJ0rgrpyxDTaEbRj/Vr+gBQ=; b=PosRisWisVHd7nav/MnCqyWyW/Zrn/fe5T5UmTf0vKZDGHxB11Wyv0r9N3sPQwKKlg 2TmI2UafkReB3c/ExWtCxvD9msXWm4BQv9LJJ0hHGwdUptRNVvXawa+ym0agopcQ5SK0 BiNTmFPA90hdirRim4JL/ZgOtz6gDwdMq79kWRguDiXPQQd8EY1EIvUcT+R2+JA5AHB0 JshTx3J+xWfHZrPkMYdNM7tzGDcSzb/Jhh2MhJt0GSL4zumPtyZ14/hPJfNfTvawmZXw hRjIOSxUGigLO8TIrfEhKDhu92sRzE91y0gaT2zZR62kFlIWaLXi0Gv5Rl0xt19BizzR zmSw== X-Forwarded-Encrypted: i=1; AJvYcCUXtptQ/U6c1UKaUslNoW9IUpNW1awHNAedQNeqMrhuTEOaVUqJ1NTcrA03HbmkA9gu4oCE9Qy6LK8IqTk=@vger.kernel.org X-Gm-Message-State: AOJu0YzOpYU8Tp9llqW8PzgPYO5p5oiog1Hs0qeOLpxYgR3zcGSZghqb 7oJQDBVHMPYeG3iDn1KdS6f497hKow5oaOGlOjXBsCPiLlY6wDAOcLC7O4QRsLI= X-Google-Smtp-Source: AGHT+IEOVFhFei02kQq1ltQWyF+j1C3XIjJZGZV1LAM2AT1TQ/js4+EAJqEYuNTXGh31ae8isAOkHA== X-Received: by 2002:a17:903:2391:b0:20c:b274:34d0 with SMTP id d9443c01a7336-211835d19e6mr136506735ad.46.1731252568109; Sun, 10 Nov 2024 07:29:28 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.26 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:27 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 10/15] mm/filemap: make buffered writes work with RWF_UNCACHED Date: Sun, 10 Nov 2024 08:28:02 -0700 Message-ID: <20241110152906.1747545-11-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" If RWF_UNCACHED is set for a write, mark new folios being written with uncached. This is done by passing in the fact that it's an uncached write through the folio pointer. We can only get there when IOCB_UNCACHED was allowed, which can only happen if the file system opts in. Opting in means they need to check for the LSB in the folio pointer to know if it's an uncached write or not. If it is, then FGP_UNCACHED should be used if creating new folios is necessary. Uncached writes will drop any folios they create upon writeback completion, but leave folios that may exist in that range alone. Since ->write_begin() doesn't currently take any flags, and to avoid needing to change the callback kernel wide, use the foliop being passed in to ->write_begin() to signal if this is an uncached write or not. File systems can then use that to mark newly created folios as uncached. Add a helper, generic_uncached_write(), that generic_file_write_iter() calls upon successful completion of an uncached write. This provides similar benefits to using RWF_UNCACHED with reads. Testing buffered writes on 32 files: writing bs 65536, uncached 0 1s: 196035MB/sec, MB=3D196035 2s: 132308MB/sec, MB=3D328147 3s: 132438MB/sec, MB=3D460586 4s: 116528MB/sec, MB=3D577115 5s: 103898MB/sec, MB=3D681014 6s: 108893MB/sec, MB=3D789907 7s: 99678MB/sec, MB=3D889586 8s: 106545MB/sec, MB=3D996132 9s: 106826MB/sec, MB=3D1102958 10s: 101544MB/sec, MB=3D1204503 11s: 111044MB/sec, MB=3D1315548 12s: 124257MB/sec, MB=3D1441121 13s: 116031MB/sec, MB=3D1557153 14s: 114540MB/sec, MB=3D1671694 15s: 115011MB/sec, MB=3D1786705 16s: 115260MB/sec, MB=3D1901966 17s: 116068MB/sec, MB=3D2018034 18s: 116096MB/sec, MB=3D2134131 where it's quite obvious where the page cache filled, and performance dropped from to about half of where it started, settling in at around 115GB/sec. Meanwhile, 32 kswapds were running full steam trying to reclaim pages. Running the same test with uncached buffered writes: writing bs 65536, uncached 1 1s: 198974MB/sec 2s: 189618MB/sec 3s: 193601MB/sec 4s: 188582MB/sec 5s: 193487MB/sec 6s: 188341MB/sec 7s: 194325MB/sec 8s: 188114MB/sec 9s: 192740MB/sec 10s: 189206MB/sec 11s: 193442MB/sec 12s: 189659MB/sec 13s: 191732MB/sec 14s: 190701MB/sec 15s: 191789MB/sec 16s: 191259MB/sec 17s: 190613MB/sec 18s: 191951MB/sec and the behavior is fully predictable, performing the same throughout even after the page cache would otherwise have fully filled with dirty data. It's also about 65% faster, and using half the CPU of the system compared to the normal buffered write. Signed-off-by: Jens Axboe --- include/linux/pagemap.h | 29 +++++++++++++++++++++++++++++ mm/filemap.c | 26 +++++++++++++++++++++++--- 2 files changed, 52 insertions(+), 3 deletions(-) diff --git a/include/linux/pagemap.h b/include/linux/pagemap.h index 0122b3fbe2ac..5469664f66c3 100644 --- a/include/linux/pagemap.h +++ b/include/linux/pagemap.h @@ -14,6 +14,7 @@ #include #include #include /* for in_interrupt() */ +#include #include =20 struct folio_batch; @@ -70,6 +71,34 @@ static inline int filemap_write_and_wait(struct address_= space *mapping) return filemap_write_and_wait_range(mapping, 0, LLONG_MAX); } =20 +/* + * generic_uncached_write - start uncached writeback + * @iocb: the iocb that was written + * @written: the amount of bytes written + * + * When writeback has been handled by write_iter, this helper should be ca= lled + * if the file system supports uncached writes. If %IOCB_UNCACHED is set, = it + * will kick off writeback for the specified range. + */ +static inline void generic_uncached_write(struct kiocb *iocb, ssize_t writ= ten) +{ + if (iocb->ki_flags & IOCB_UNCACHED) { + struct address_space *mapping =3D iocb->ki_filp->f_mapping; + + /* kick off uncached writeback */ + __filemap_fdatawrite_range(mapping, iocb->ki_pos, + iocb->ki_pos + written, WB_SYNC_NONE); + } +} + +/* + * Value passed in to ->write_begin() if IOCB_UNCACHED is set for the writ= e, + * and the ->write_begin() handler on a file system supporting FOP_UNCACHED + * must check for this and pass FGP_UNCACHED for folio creation. + */ +#define foliop_uncached ((struct folio *) 0xfee1c001) +#define foliop_is_uncached(foliop) (*(foliop) =3D=3D foliop_uncached) + /** * filemap_set_wb_err - set a writeback error on an address_space * @mapping: mapping in which to set writeback error diff --git a/mm/filemap.c b/mm/filemap.c index efd02b047541..cfbfc8b14b1f 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -430,6 +430,7 @@ int __filemap_fdatawrite_range(struct address_space *ma= pping, loff_t start, =20 return filemap_fdatawrite_wbc(mapping, &wbc); } +EXPORT_SYMBOL_GPL(__filemap_fdatawrite_range); =20 static inline int __filemap_fdatawrite(struct address_space *mapping, int sync_mode) @@ -1609,7 +1610,14 @@ static void folio_end_uncached(struct folio *folio) { bool reset =3D true; =20 - if (folio_trylock(folio)) { + /* + * Hitting !in_task() should not happen off RWF_UNCACHED writeback, but + * can happen if normal writeback just happens to find dirty folios + * that were created as part of uncached writeback, and that writeback + * would otherwise not need non-IRQ handling. Just skip the + * invalidation in that case. + */ + if (in_task() && folio_trylock(folio)) { reset =3D !invalidate_complete_folio2(folio->mapping, folio, 0); folio_unlock(folio); } @@ -4061,7 +4069,7 @@ ssize_t generic_perform_write(struct kiocb *iocb, str= uct iov_iter *i) ssize_t written =3D 0; =20 do { - struct folio *folio; + struct folio *folio =3D NULL; size_t offset; /* Offset into folio */ size_t bytes; /* Bytes to write to folio */ size_t copied; /* Bytes copied from user */ @@ -4089,6 +4097,16 @@ ssize_t generic_perform_write(struct kiocb *iocb, st= ruct iov_iter *i) break; } =20 + /* + * If IOCB_UNCACHED is set here, we now the file system + * supports it. And hence it'll know to check folip for being + * set to this magic value. If so, it's an uncached write. + * Whenever ->write_begin() changes prototypes again, this + * can go away and just pass iocb or iocb flags. + */ + if (iocb->ki_flags & IOCB_UNCACHED) + folio =3D foliop_uncached; + status =3D a_ops->write_begin(file, mapping, pos, bytes, &folio, &fsdata); if (unlikely(status < 0)) @@ -4219,8 +4237,10 @@ ssize_t generic_file_write_iter(struct kiocb *iocb, = struct iov_iter *from) ret =3D __generic_file_write_iter(iocb, from); inode_unlock(inode); =20 - if (ret > 0) + if (ret > 0) { + generic_uncached_write(iocb, ret); ret =3D generic_write_sync(iocb, ret); + } return ret; } EXPORT_SYMBOL(generic_file_write_iter); --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pg1-f177.google.com (mail-pg1-f177.google.com [209.85.215.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 630BA157E78 for ; Sun, 10 Nov 2024 15:29:30 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.215.177 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252571; cv=none; b=NjU6aSwu7PgualkTOwSkZm7WEPTvjExgrd87YXPaOV8f+P3sfU8qZLcDzG1wxRrC67k9qpOowg47Pt3GQgERHv0Xnp9lQmUDsy7Si/t/4PrErWSPsJYlmPlKFz6U7QAspQwcYzzURP1T85E4yzw6eckJHIKpggNMQb2op8V9aXM= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252571; c=relaxed/simple; bh=7n4q+dbKFb0eQ3Qygk7Z+kektGylKUoxfQQqH74ekkQ=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=C51dOXtY+xI0+lkhWzv4na5anAsdf7Z5HPbfS+mwZIEaiZDjE08Dck4dDbyk8epBQ01iKmcFAJJais+4zDKSZFURon12HS1UW4q1xWdnh8FQZAv2cpGWZddlzz0cX6LJuLtUrsvXOnkXaKQ/tQHBacp+wbI6HTEZ3Yla9rpty6M= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=Wy+4sM/C; arc=none smtp.client-ip=209.85.215.177 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="Wy+4sM/C" Received: by mail-pg1-f177.google.com with SMTP id 41be03b00d2f7-7ea7ad1e01fso2697558a12.0 for ; Sun, 10 Nov 2024 07:29:30 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252569; x=1731857369; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=2Ft1nD1O8TW6ajHAbmpyRJ7eqllXRgDsj9KVxLudGNI=; b=Wy+4sM/Cvbe99m987p/3qS0IogzcZGYbEAMzNhAT4Eaj0llnpLqnLvEMmp5lJJHWJA zyaq38zy5wPRH0tnH7L+Y5YJKpTO1ZFJYv3tnOvys6EvpbVMao+xkiACAmkS0MoKYbuP WuTlPbhxGXVtjRWii2iAcJeS0eI+eKiTgHhuJ/3Wqhd7ad3KJRHuOhFVueF6SDVDU7Vj LDHQvPS3vVIymeGFiMUdlJjHVyqDzvlWd6znEyUeWK285+L77j+Bv+aNGUahC6fy1WTG gKZ8c22qIckzpA1GNJAhs5lHilnHB3AdkzruNGxnEzUCX/P0Kr6hmZ9cd45FoCWBeSNF 7+fQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252569; x=1731857369; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=2Ft1nD1O8TW6ajHAbmpyRJ7eqllXRgDsj9KVxLudGNI=; b=NDr8EaEu8y9tqoydiNdlKijZ6WzOHTfE1KKM8LcGgTrpEHXG/jynpwlwHlR9dhsMpr rg17ZJJXe0W+im/GRUh/rFXr1NMwdf1zFLfxuHUl2alg+JFGvGPwybDXMP2FPepJZAJ+ pySamKJB+Nfxd/NDWCri3BkN1z3qdoNTTmQLIcKF8pe4UJoZ5GBB6NVKVB6YmBiHn37S QY3FZWUGoB4sYcdh4E1dR4kc2nyGMyFqR2QSag6FEJH1isZEJyxIxkYjUYfmHQ89JhA1 +XunziwoxsCvgjvr180GGIHpXAlRFJaWtlo2a36X6A3OeBLs4lgU6GR9kEPpsgXqvmae wtxQ== X-Forwarded-Encrypted: i=1; AJvYcCWToA9YnDvZUoYsazG9UM+gv+ZL3zSXyvOnfrqgLKE/DFL/2BoVZQaLxFihmh7FlhdjuZvqP676omW0n1s=@vger.kernel.org X-Gm-Message-State: AOJu0YzNzHYkqgwFOlbMOfTGvf7ft2BTG+B999or3o9TjIJGZfVpHsj0 df3TKs7P8sfzsfITj9U63zv8DL97IfaFkMyY6wqNFKq6dOYlfrCLv0fjKxyZrwQ= X-Google-Smtp-Source: AGHT+IGnzg2hDaZ2lVFGocJncR7soOdtnLL/Yc809zXi6ImqKmn+BDcRSWipzylZxh/jlIYcFDHDiw== X-Received: by 2002:a17:90b:1bc3:b0:2e0:7b03:1908 with SMTP id 98e67ed59e1d1-2e9b0a57d33mr14506762a91.10.1731252569648; Sun, 10 Nov 2024 07:29:29 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.28 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:28 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 11/15] mm: add FGP_UNCACHED folio creation flag Date: Sun, 10 Nov 2024 08:28:03 -0700 Message-ID: <20241110152906.1747545-12-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Callers can pass this in for uncached folio creation, in which case if a folio is newly created it gets marked as uncached. If a folio exists for this index and lookup succeeds, then it will not get marked as uncached. Signed-off-by: Jens Axboe --- include/linux/pagemap.h | 2 ++ mm/filemap.c | 2 ++ 2 files changed, 4 insertions(+) diff --git a/include/linux/pagemap.h b/include/linux/pagemap.h index 5469664f66c3..de0ed906cd2d 100644 --- a/include/linux/pagemap.h +++ b/include/linux/pagemap.h @@ -741,6 +741,7 @@ pgoff_t page_cache_prev_miss(struct address_space *mapp= ing, * * %FGP_NOFS - __GFP_FS will get cleared in gfp. * * %FGP_NOWAIT - Don't block on the folio lock. * * %FGP_STABLE - Wait for the folio to be stable (finished writeback) + * * %FGP_UNCACHED - Uncached buffered IO * * %FGP_WRITEBEGIN - The flags to use in a filesystem write_begin() * implementation. */ @@ -754,6 +755,7 @@ typedef unsigned int __bitwise fgf_t; #define FGP_NOWAIT ((__force fgf_t)0x00000020) #define FGP_FOR_MMAP ((__force fgf_t)0x00000040) #define FGP_STABLE ((__force fgf_t)0x00000080) +#define FGP_UNCACHED ((__force fgf_t)0x00000100) #define FGF_GET_ORDER(fgf) (((__force unsigned)fgf) >> 26) /* top 6 bits */ =20 #define FGP_WRITEBEGIN (FGP_LOCK | FGP_WRITE | FGP_CREAT | FGP_STABLE) diff --git a/mm/filemap.c b/mm/filemap.c index cfbfc8b14b1f..4fdf3c4ae00f 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -1987,6 +1987,8 @@ struct folio *__filemap_get_folio(struct address_spac= e *mapping, pgoff_t index, /* Init accessed so avoid atomic mark_page_accessed later */ if (fgp_flags & FGP_ACCESSED) __folio_set_referenced(folio); + if (fgp_flags & FGP_UNCACHED) + __folio_set_uncached(folio); =20 err =3D filemap_add_folio(mapping, folio, index, gfp); if (!err) --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pg1-f180.google.com (mail-pg1-f180.google.com [209.85.215.180]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id CFDF215A85A for ; Sun, 10 Nov 2024 15:29:31 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.215.180 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252574; cv=none; b=Np4PAmOb5aNkQ7VvEcz6YpfY2+HZsxFckCvkdSh9MhyMGAYnzIZU1AbdTaDheyguB1fUMitZQtRoQpMmpWX8zoDl4rXVCldL4F1vKKejr+EzakVSFeAwXMkPO7PFq9ebXQ73XOd1Xjf/i375wJwjQgouYWY9RCT1+vwwX046cRo= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252574; c=relaxed/simple; bh=6ZSmJO+bG/O0VvG/m0Wnv2BJ9smDd+VmRMf5KfhKfrc=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=TKc+3avQa1uCIwnrcsrFtbrKqKpYdzKaqgiNui8SYgXLfi5Sv9DY7b/KNMA3l7v7T+Dcai/vBYlBSjEneneaVDelZCH/ATrMddaJzPhPHOidzGjayP91VzWfpE8Zm6P/m6d0Mdj/gbDo4XzdcQiVDz/YWuZyPFL21gnXKhVB70c= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=fpRWAWEC; arc=none smtp.client-ip=209.85.215.180 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="fpRWAWEC" Received: by mail-pg1-f180.google.com with SMTP id 41be03b00d2f7-7f43259d220so1590425a12.3 for ; Sun, 10 Nov 2024 07:29:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252571; x=1731857371; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=TkS4Adcluut9E9TBSf0Q5Dr6BdVnPYdslAPkTGEhwiY=; b=fpRWAWECI5OtTQlcLOn/rgoWffS5D1nG5AfXfjwzuPyRLJ8+q4/d1VRom1qGPd9paI 3wZm6TypwMh8/eeTb7hrykWDheH5TKWVjEdZP9S6jtkO3On+oPWhGGRz6L6nMgnRLyHg kRrTTdnBQoh9VNuPxK/nMU6aDLydSJNxkO1knvpLPPuGyqUuKkMg/HkYwFzpGgZnLcGa pJBW+xlqZuQb32kfOp190V4ODE7ol7jvjmZxNL0lPnzUsyqfe3xxdAvW1p5m1Trf+VCI Keiya1aYrm4Yqd4XZqCOH6zE7ywioLUtihFSRxR4uB5xQ/wKeN9IvJuATU6+sODzi3sL IOtA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252571; x=1731857371; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=TkS4Adcluut9E9TBSf0Q5Dr6BdVnPYdslAPkTGEhwiY=; b=HFttgr9ULQ1wBqR0xpgL00R92OJSUUjogGldKb7dgYbmqJlec5HPvhzvWZtqgXgLda 5iEpM5CwUgQmxKhohHMcRMzJ+tcN05pnDCRChqIieYKBeRwDRxoNxnwGrOHTPO0vWYNZ QyssF1z4nGdR90BypvwyT7+wITETejQDKZGPRzZe3WwrgKB4tNTLO416aB6V0ansBcw+ pbphLzUAihPJdNYsBCjUNphLDv3PjMAY5k4wqEbmesVBEIzc46UfuYOyzHbctVDdFkFD GBgUQHT/1tK9dcs5jJINHizevoJvFgmJZFmg86GZ8oAPclVj9UKNCnAthp2ayHdg+BPc J8ow== X-Forwarded-Encrypted: i=1; AJvYcCUlYxlcwSYOVR+zhjppQDJis0bnPUjvc6ursktedZAUibq7byjhP/VCEhQgXzT58RmrV/OhBYCKOHE+ahU=@vger.kernel.org X-Gm-Message-State: AOJu0Yydm1fnbyM2De13CQNAfDlEZ/MItySYGTlHIOlg6e5uzpfqbRg6 J2AAl23MvAsvF39bCk/4UeF0o24bCOdgz8kcMeHTUi7owtCb1KWaxvKUr8MwILg= X-Google-Smtp-Source: AGHT+IHRHbLBUb09CwGoIXN66uipNoc+nR4rS0OGfJWtMF1GUwRkiOluIK+uk59p6smvN8SFsh8EQg== X-Received: by 2002:a17:90b:2b8e:b0:2e2:b69c:2a9 with SMTP id 98e67ed59e1d1-2e9b1770362mr13736497a91.26.1731252571252; Sun, 10 Nov 2024 07:29:31 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.29 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:30 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 12/15] ext4: add RWF_UNCACHED write support Date: Sun, 10 Nov 2024 08:28:04 -0700 Message-ID: <20241110152906.1747545-13-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" IOCB_UNCACHED IO needs to prune writeback regions on IO completion, and hence need the worker punt that ext4 also does for unwritten extents. Add an io_end flag to manage that. If foliop is set to foliop_uncached in ext4_write_begin(), then set FGP_UNCACHED so that __filemap_get_folio() will mark newly created folios as uncached. That in turn will make writeback completion drop these ranges from the page cache. Now that ext4 supports both uncached reads and writes, add the fop_flag FOP_UNCACHED to enable it. Signed-off-by: Jens Axboe --- fs/ext4/ext4.h | 1 + fs/ext4/file.c | 2 +- fs/ext4/inline.c | 7 ++++++- fs/ext4/inode.c | 18 ++++++++++++++++-- fs/ext4/page-io.c | 28 ++++++++++++++++------------ 5 files changed, 40 insertions(+), 16 deletions(-) diff --git a/fs/ext4/ext4.h b/fs/ext4/ext4.h index 44b0d418143c..60dc9ffae076 100644 --- a/fs/ext4/ext4.h +++ b/fs/ext4/ext4.h @@ -279,6 +279,7 @@ struct ext4_system_blocks { * Flags for ext4_io_end->flags */ #define EXT4_IO_END_UNWRITTEN 0x0001 +#define EXT4_IO_UNCACHED 0x0002 =20 struct ext4_io_end_vec { struct list_head list; /* list of io_end_vec */ diff --git a/fs/ext4/file.c b/fs/ext4/file.c index f14aed14b9cf..0ef39d738598 100644 --- a/fs/ext4/file.c +++ b/fs/ext4/file.c @@ -944,7 +944,7 @@ const struct file_operations ext4_file_operations =3D { .splice_write =3D iter_file_splice_write, .fallocate =3D ext4_fallocate, .fop_flags =3D FOP_MMAP_SYNC | FOP_BUFFER_RASYNC | - FOP_DIO_PARALLEL_WRITE, + FOP_DIO_PARALLEL_WRITE | FOP_UNCACHED, }; =20 const struct inode_operations ext4_file_inode_operations =3D { diff --git a/fs/ext4/inline.c b/fs/ext4/inline.c index 3536ca7e4fcc..4089d0744164 100644 --- a/fs/ext4/inline.c +++ b/fs/ext4/inline.c @@ -667,6 +667,7 @@ int ext4_try_to_write_inline_data(struct address_space = *mapping, handle_t *handle; struct folio *folio; struct ext4_iloc iloc; + fgf_t fgp_flags; =20 if (pos + len > ext4_get_max_inline_size(inode)) goto convert; @@ -702,7 +703,11 @@ int ext4_try_to_write_inline_data(struct address_space= *mapping, if (ret) goto out; =20 - folio =3D __filemap_get_folio(mapping, 0, FGP_WRITEBEGIN | FGP_NOFS, + fgp_flags =3D FGP_WRITEBEGIN | FGP_NOFS; + if (*foliop =3D=3D foliop_uncached) + fgp_flags |=3D FGP_UNCACHED; + + folio =3D __filemap_get_folio(mapping, 0, fgp_flags, mapping_gfp_mask(mapping)); if (IS_ERR(folio)) { ret =3D PTR_ERR(folio); diff --git a/fs/ext4/inode.c b/fs/ext4/inode.c index 54bdd4884fe6..afae3ab64c9e 100644 --- a/fs/ext4/inode.c +++ b/fs/ext4/inode.c @@ -1138,6 +1138,7 @@ static int ext4_write_begin(struct file *file, struct= address_space *mapping, int ret, needed_blocks; handle_t *handle; int retries =3D 0; + fgf_t fgp_flags; struct folio *folio; pgoff_t index; unsigned from, to; @@ -1164,6 +1165,15 @@ static int ext4_write_begin(struct file *file, struc= t address_space *mapping, return 0; } =20 + /* + * Set FGP_WRITEBEGIN, and FGP_UNCACHED if foliop contains + * foliop_uncached. That's how generic_perform_write() informs us + * that this is an uncached write. + */ + fgp_flags =3D FGP_WRITEBEGIN; + if (*foliop =3D=3D foliop_uncached) + fgp_flags |=3D FGP_UNCACHED; + /* * __filemap_get_folio() can take a long time if the * system is thrashing due to memory pressure, or if the folio @@ -1172,7 +1182,7 @@ static int ext4_write_begin(struct file *file, struct= address_space *mapping, * the folio (if needed) without using GFP_NOFS. */ retry_grab: - folio =3D __filemap_get_folio(mapping, index, FGP_WRITEBEGIN, + folio =3D __filemap_get_folio(mapping, index, fgp_flags, mapping_gfp_mask(mapping)); if (IS_ERR(folio)) return PTR_ERR(folio); @@ -2903,6 +2913,7 @@ static int ext4_da_write_begin(struct file *file, str= uct address_space *mapping, struct folio *folio; pgoff_t index; struct inode *inode =3D mapping->host; + fgf_t fgp_flags; =20 if (unlikely(ext4_forced_shutdown(inode->i_sb))) return -EIO; @@ -2926,8 +2937,11 @@ static int ext4_da_write_begin(struct file *file, st= ruct address_space *mapping, return 0; } =20 + fgp_flags =3D FGP_WRITEBEGIN; + if (*foliop =3D=3D foliop_uncached) + fgp_flags |=3D FGP_UNCACHED; retry: - folio =3D __filemap_get_folio(mapping, index, FGP_WRITEBEGIN, + folio =3D __filemap_get_folio(mapping, index, fgp_flags, mapping_gfp_mask(mapping)); if (IS_ERR(folio)) return PTR_ERR(folio); diff --git a/fs/ext4/page-io.c b/fs/ext4/page-io.c index ad5543866d21..10447c3c4ff1 100644 --- a/fs/ext4/page-io.c +++ b/fs/ext4/page-io.c @@ -226,8 +226,6 @@ static void ext4_add_complete_io(ext4_io_end_t *io_end) unsigned long flags; =20 /* Only reserved conversions from writeback should enter here */ - WARN_ON(!(io_end->flag & EXT4_IO_END_UNWRITTEN)); - WARN_ON(!io_end->handle && sbi->s_journal); spin_lock_irqsave(&ei->i_completed_io_lock, flags); wq =3D sbi->rsv_conversion_wq; if (list_empty(&ei->i_rsv_conversion_list)) @@ -252,7 +250,7 @@ static int ext4_do_flush_completed_IO(struct inode *ino= de, =20 while (!list_empty(&unwritten)) { io_end =3D list_entry(unwritten.next, ext4_io_end_t, list); - BUG_ON(!(io_end->flag & EXT4_IO_END_UNWRITTEN)); + BUG_ON(!(io_end->flag & (EXT4_IO_END_UNWRITTEN|EXT4_IO_UNCACHED))); list_del_init(&io_end->list); =20 err =3D ext4_end_io_end(io_end); @@ -287,14 +285,15 @@ ext4_io_end_t *ext4_init_io_end(struct inode *inode, = gfp_t flags) =20 void ext4_put_io_end_defer(ext4_io_end_t *io_end) { - if (refcount_dec_and_test(&io_end->count)) { - if (!(io_end->flag & EXT4_IO_END_UNWRITTEN) || - list_empty(&io_end->list_vec)) { - ext4_release_io_end(io_end); - return; - } - ext4_add_complete_io(io_end); + if (!refcount_dec_and_test(&io_end->count)) + return; + if ((!(io_end->flag & EXT4_IO_END_UNWRITTEN) || + list_empty(&io_end->list_vec)) && + !(io_end->flag & EXT4_IO_UNCACHED)) { + ext4_release_io_end(io_end); + return; } + ext4_add_complete_io(io_end); } =20 int ext4_put_io_end(ext4_io_end_t *io_end) @@ -348,7 +347,7 @@ static void ext4_end_bio(struct bio *bio) blk_status_to_errno(bio->bi_status)); } =20 - if (io_end->flag & EXT4_IO_END_UNWRITTEN) { + if (io_end->flag & (EXT4_IO_END_UNWRITTEN|EXT4_IO_UNCACHED)) { /* * Link bio into list hanging from io_end. We have to do it * atomically as bio completions can be racing against each @@ -417,8 +416,13 @@ static void io_submit_add_bh(struct ext4_io_submit *io, submit_and_retry: ext4_io_submit(io); } - if (io->io_bio =3D=3D NULL) + if (io->io_bio =3D=3D NULL) { io_submit_init_bio(io, bh); + if (folio_test_uncached(folio)) { + ext4_io_end_t *io_end =3D io->io_bio->bi_private; + io_end->flag |=3D EXT4_IO_UNCACHED; + } + } if (!bio_add_folio(io->io_bio, io_folio, bh->b_size, bh_offset(bh))) goto submit_and_retry; wbc_account_cgroup_owner(io->io_wbc, &folio->page, bh->b_size); --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pl1-f169.google.com (mail-pl1-f169.google.com [209.85.214.169]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 3C00515DBBA for ; Sun, 10 Nov 2024 15:29:33 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.214.169 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252574; cv=none; b=DRXH3ZVmYnWGgvU7kFGaEckf0yH59hSLWySraKVTHRNfxwK21ceUWtkex5Z7fg8y20x+aRAF5gxRG1ILNNyLeFCFJAdbVU5ansuz3nf4vqif77jMdK3cG1g6BXkGVUvmZJPnzQn7nRVOcsE8sbq5sBNEAOlmDdAJm082axty7PY= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252574; c=relaxed/simple; bh=3H9STUASulXCHklYsN+md87lfFFsAKUuDScGACrWthk=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=sCFnR60y2WEKAUUdrPqqHhb3jo0biFDZfZgQe70AqdFjs6KM4u29fcAIlavGCbN27gR1B4i6KsSIBvFg250mWGLgaMDCYYRPxQi12bbp+CRxkUZYRFJTT/6+da+hqrvVvJXZMTAmFl3pcFjio7N2UcGw6uL1OfNRx6WDnhiVqz0= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=OI/Ky1or; arc=none smtp.client-ip=209.85.214.169 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="OI/Ky1or" Received: by mail-pl1-f169.google.com with SMTP id d9443c01a7336-20c77459558so34302035ad.0 for ; Sun, 10 Nov 2024 07:29:33 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252572; x=1731857372; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=QHHnA1NsQjr1van9cnJ+PQRWRXhY+a6ETIPA3kXFfjk=; b=OI/Ky1orJg/GBr9s+FgVuIuEalVhEHGaXMQuHn25NW3gIZ/Ttc5S3RVPzHrSyU1a45 8/H0FAdjTx8OKHd5zHDluKqXO/cxBAALgmb3ngFG1S8q0graYRHyAmyWEt3Lx54pNApI L+XinvPTxzVdMyVeTKk6LIjENaoxTWyyzkf8hIPn/zIM+UH83uvkWvkgZ0AXzGwy6DYf XTGRA1vNNwlqd9zamy/7e5Fmyl39evdcvNE817XPskBp+vQPfxQo7uI71i5KOhOWuTkp a8y6r8F3Y2HpuICeeC2zA9uWMN9Pr4vEMBvhy4KV/8mVrR5mvaPWkvQUcNKb2OvPwg8E QmNg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252572; x=1731857372; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=QHHnA1NsQjr1van9cnJ+PQRWRXhY+a6ETIPA3kXFfjk=; b=KD+K3+WXybowbdboXCddSwr3OK3CySFAkceGWGnYW9GcVZcSEEoee5UFlLl/U4d7M+ KvCcFaDmcaxuyW7KO9zLMGXE0W16nSHR2ppn540HD65CkXug8iswor5CoRFhplmNJkRD caxFy+2x1It+a31+jXoAutDXlngBJxIXbLHJwlS8ECylBsECMwOmv2UXj+uRu2xYSLNf uBRIwY/XLEDALyVW/lcneUmgQd99ixXCpmJZL0p572IjVYuh/Qah/tEiz0fii3SHZ84C DlgAIKHR1pYiHfWTML4td/1cBq6PW9f6EbtmTKcam+eXQ0UXtUupAz54cvh6zNqM9Muk QKrw== X-Forwarded-Encrypted: i=1; AJvYcCUBjrCJjx/z+TnUqy/zlI5LqwkKJxKqz0TDix88gRsVDy1L6pDhaIxa57SGX8WdJ8EJYRrU+gm1g0PcSr8=@vger.kernel.org X-Gm-Message-State: AOJu0YyIsZ4zFN4RyPWtfWkWftcOw7KKaH375ALK5iUkzgw2uT3G0g8f ghZE8QUdrKpbwLm+8lljrfHMjE3J0e5YXABFoAsJzNNcq5YOwbnKR5SrtEPyhBLV0CVQUCE1yQ/ wt8I= X-Google-Smtp-Source: AGHT+IHV0wAiD3bQchKcabbTvXh2pU9AbMQUOshna304t+XB0bKs6GgV7ufQgRJwzKJwVoPNWeBA0A== X-Received: by 2002:a17:90a:e7c1:b0:2da:9115:15ce with SMTP id 98e67ed59e1d1-2e9b1682714mr12810753a91.15.1731252572603; Sun, 10 Nov 2024 07:29:32 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.31 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:31 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 13/15] iomap: make buffered writes work with RWF_UNCACHED Date: Sun, 10 Nov 2024 08:28:05 -0700 Message-ID: <20241110152906.1747545-14-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Add iomap buffered write support for RWF_UNCACHED. If RWF_UNCACHED is set for a write, mark the folios being written with drop_writeback. Then writeback completion will drop the pages. The write_iter handler simply kicks off writeback for the pages, and writeback completion will take care of the rest. This still needs the user of the iomap buffered write helpers to call iocb_uncached_write() upon successful issue of the writes. Signed-off-by: Jens Axboe --- fs/iomap/buffered-io.c | 15 +++++++++++++-- include/linux/iomap.h | 4 +++- 2 files changed, 16 insertions(+), 3 deletions(-) diff --git a/fs/iomap/buffered-io.c b/fs/iomap/buffered-io.c index ef0b68bccbb6..2f2a5db04a68 100644 --- a/fs/iomap/buffered-io.c +++ b/fs/iomap/buffered-io.c @@ -603,6 +603,8 @@ struct folio *iomap_get_folio(struct iomap_iter *iter, = loff_t pos, size_t len) =20 if (iter->flags & IOMAP_NOWAIT) fgp |=3D FGP_NOWAIT; + if (iter->flags & IOMAP_UNCACHED) + fgp |=3D FGP_UNCACHED; fgp |=3D fgf_set_order(len); =20 return __filemap_get_folio(iter->inode->i_mapping, pos >> PAGE_SHIFT, @@ -1023,8 +1025,9 @@ ssize_t iomap_file_buffered_write(struct kiocb *iocb, struct iov_iter *i, const struct iomap_ops *ops, void *private) { + struct address_space *mapping =3D iocb->ki_filp->f_mapping; struct iomap_iter iter =3D { - .inode =3D iocb->ki_filp->f_mapping->host, + .inode =3D mapping->host, .pos =3D iocb->ki_pos, .len =3D iov_iter_count(i), .flags =3D IOMAP_WRITE, @@ -1034,9 +1037,14 @@ iomap_file_buffered_write(struct kiocb *iocb, struct= iov_iter *i, =20 if (iocb->ki_flags & IOCB_NOWAIT) iter.flags |=3D IOMAP_NOWAIT; + if (iocb->ki_flags & IOCB_UNCACHED) + iter.flags |=3D IOMAP_UNCACHED; =20 - while ((ret =3D iomap_iter(&iter, ops)) > 0) + while ((ret =3D iomap_iter(&iter, ops)) > 0) { + if (iocb->ki_flags & IOCB_UNCACHED) + iter.iomap.flags |=3D IOMAP_F_UNCACHED; iter.processed =3D iomap_write_iter(&iter, i); + } =20 if (unlikely(iter.pos =3D=3D iocb->ki_pos)) return ret; @@ -1770,6 +1778,9 @@ static int iomap_add_to_ioend(struct iomap_writepage_= ctx *wpc, size_t poff =3D offset_in_folio(folio, pos); int error; =20 + if (folio_test_uncached(folio)) + wpc->iomap.flags |=3D IOMAP_F_UNCACHED; + if (!wpc->ioend || !iomap_can_add_to_ioend(wpc, pos)) { new_ioend: error =3D iomap_submit_ioend(wpc, 0); diff --git a/include/linux/iomap.h b/include/linux/iomap.h index f61407e3b121..2efc72df19a2 100644 --- a/include/linux/iomap.h +++ b/include/linux/iomap.h @@ -64,6 +64,7 @@ struct vm_fault; #define IOMAP_F_BUFFER_HEAD 0 #endif /* CONFIG_BUFFER_HEAD */ #define IOMAP_F_XATTR (1U << 5) +#define IOMAP_F_UNCACHED (1U << 6) =20 /* * Flags set by the core iomap code during operations: @@ -173,8 +174,9 @@ struct iomap_folio_ops { #define IOMAP_NOWAIT (1 << 5) /* do not block */ #define IOMAP_OVERWRITE_ONLY (1 << 6) /* only pure overwrites allowed */ #define IOMAP_UNSHARE (1 << 7) /* unshare_file_range */ +#define IOMAP_UNCACHED (1 << 8) /* uncached IO */ #ifdef CONFIG_FS_DAX -#define IOMAP_DAX (1 << 8) /* DAX mapping */ +#define IOMAP_DAX (1 << 9) /* DAX mapping */ #else #define IOMAP_DAX 0 #endif /* CONFIG_FS_DAX */ --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pf1-f179.google.com (mail-pf1-f179.google.com [209.85.210.179]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id F2B6216F0D0 for ; Sun, 10 Nov 2024 15:29:34 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.210.179 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252576; cv=none; b=tYE5HkLrScqPn5S7G96gpkkPHRYGptK6yDc1Jj5njZpIBzH3MbbnxBjYVcmeThh3CBk7VWk4MYbxIax4TM12wjTbLRYUOGEJJovbjgBfcrQHAIWfLuNMXWakaTvYCZr6m0bdh+xd0nO4MSmaZbsFRUvacya6ODXG6O+i23v7QQw= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252576; c=relaxed/simple; bh=JN6ziuaz3h3w93syhui0sSuROUWYBUO3x3xlg3f4CsA=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=pxVuwtKOAXvIKPgc06hrmAEDXSmVAyAaz7t5IB4p1XlBDCXye3iZ85+17t3Dt/4sjHb1utiTRHk6EVeiHmhmXv8dH16XoTs8d2ogwVB3G3WWHeuYzbm1DOkinPA716+acpyLmYeMNyJbm0KWgODH6oisQVKrNMxqC7w+TgUKbho= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=u8x6MYx/; arc=none smtp.client-ip=209.85.210.179 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="u8x6MYx/" Received: by mail-pf1-f179.google.com with SMTP id d2e1a72fcca58-723f37dd76cso3790018b3a.0 for ; Sun, 10 Nov 2024 07:29:34 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252574; x=1731857374; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=LL88Qd2tMlsmBvcuEyGfQPoq7EBSjG4mmD1oB3CTmL4=; b=u8x6MYx/UtKOMks7NJ+TLYm6DBTL4EOTB7sb6PmubdEYVNejDlMm2Eq76FR3nXWymp r3zVlFa6uMYain7P9bUHme/pWSDRtjdvIRZ5SMtKrWvTifsLiEgWVOJ9FOTpnL2CiGRy 7+j71tC8+NhT3Mx1+LDHq1Eh0iNEvGcTkJ1Kw1FQCPWqDGZhpknvWngSsX3eu91RDTeJ Ssh4TMi06dZ4R0X0frUPK79mIeN1IbzJElZIyguXJHhFqDbPyARWZ4mjBWIDG7/s6Tig aLBvrPBXUgKqT3se0RKHpC/2YT+XO9kdLOabVfuirvLl8LLFRMAAY5gqNp/VvQn7xS3o Gzeg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252574; x=1731857374; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=LL88Qd2tMlsmBvcuEyGfQPoq7EBSjG4mmD1oB3CTmL4=; b=JBbGxfNG0xoWdAo0Y4KBi/gj7Qr6T1o9otkkRdZmotIlm/omZ+tO6Pt1L9CHFiH5ez osipcylnHVHwkoWwl1I/zY3nBb3d40y2BujZg8bn0mpqZU2B4Bjj90FnWhL5uX8swACt cEUKuYUFnNoDk7NsIriu6SHFPC2+0wqI5HFpiqLhd/V55Vm2RWuVyYdWaAWVndLeF3Dr Hp7EBA4QNe1uAEFeFgSO9LUzccWg4FxuCCZ1Nwc43QitOhEuNEgeGFfOBc5wkFMvj5Hy J70jXtO23Uh5iT2O3fLLZ+xfr9sZnqRtOhKRoC7NqowzyqUY3/xZT7HELHHTBPB376sv R8mw== X-Forwarded-Encrypted: i=1; AJvYcCXFv4aDFzFGE84/9BKMCYsGEFkmumVgYCAASkEvGmf2VmsrZiG/S2PiqXXGKc+LNIRWG+gCOkrwM7j3VLg=@vger.kernel.org X-Gm-Message-State: AOJu0YzZaUR0ucD1AZl4G4yXEhv54gkRUQdTcfHkILs88sRjiGy+cDY7 OHEfMVVjwXfC3aMELHky/K5cNgApUUfkqqO1/uhs7d+o3oBikIz24xy335tMdOg= X-Google-Smtp-Source: AGHT+IGDHhrWZvwz3+S2z66PgLKGWEEUzZeIBjMPJzYATyn0eBE8Y9PpfOEANie1fvuwvTvVxNqbng== X-Received: by 2002:a17:90b:38ce:b0:2e2:d15c:1a24 with SMTP id 98e67ed59e1d1-2e9b174113cmr12568464a91.23.1731252574323; Sun, 10 Nov 2024 07:29:34 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.32 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:33 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 14/15] xfs: punt uncached write completions to the completion wq Date: Sun, 10 Nov 2024 08:28:06 -0700 Message-ID: <20241110152906.1747545-15-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" They need non-irq context guaranteed, to be able to prune ranges from the page cache. Treat them like unwritten extents and punt them to the completion workqueue. Signed-off-by: Jens Axboe --- fs/xfs/xfs_aops.c | 7 +++++-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/fs/xfs/xfs_aops.c b/fs/xfs/xfs_aops.c index 559a3a577097..c86fc2b8f344 100644 --- a/fs/xfs/xfs_aops.c +++ b/fs/xfs/xfs_aops.c @@ -416,9 +416,12 @@ xfs_prepare_ioend( =20 memalloc_nofs_restore(nofs_flag); =20 - /* send ioends that might require a transaction to the completion wq */ + /* + * Send ioends that might require a transaction or need blocking + * context to the completion wq + */ if (xfs_ioend_is_append(ioend) || ioend->io_type =3D=3D IOMAP_UNWRITTEN || - (ioend->io_flags & IOMAP_F_SHARED)) + (ioend->io_flags & (IOMAP_F_SHARED|IOMAP_F_UNCACHED))) ioend->io_bio.bi_end_io =3D xfs_end_bio; return status; } --=20 2.45.2 From nobody Wed Dec 17 10:44:53 2025 Received: from mail-pj1-f53.google.com (mail-pj1-f53.google.com [209.85.216.53]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 617F617838C for ; Sun, 10 Nov 2024 15:29:36 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.53 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252577; cv=none; b=QfwwkgOX2eaeJMv8w473XM6eTiWsRQ8Aqn5Xj7ADbaBAjcWQ5ki1MMSL9Jt5VdmGb1Qm2vJ1qB5GrnXewueeaVX+BXS8vTQeXqQyWZ8aUWseKKdqZqTjHB+YXlTAnISZflNJIZxtRoxx2UNppZ+zSY/egi0bk8++3+BoMdakpdc= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1731252577; c=relaxed/simple; bh=cP6gr8wADb4EASrEnj9VSH3qpQrBlalS1qET7KiSVew=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=WB+wbSOqe6t+g1/Z3U39sj4N8X/nZczC2Nte4BZo4xwup+wePXvzHoWQWSYMYgA8HA8P4r9QGyOnszbETwSCRrz1Wf4QROS5+Rd8LPIbWXETzclIL9aKrc2gIAttJQ//7SReXL9YP1DX1lngbmRpbKuM9m+J1QIMB+Gv1UMhr2s= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=nNgGyGlD; arc=none smtp.client-ip=209.85.216.53 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="nNgGyGlD" Received: by mail-pj1-f53.google.com with SMTP id 98e67ed59e1d1-2e31af47681so3019342a91.2 for ; Sun, 10 Nov 2024 07:29:36 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1731252576; x=1731857376; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=ieWCO7ZvnQOESu0rOvKqnHS1oC/GZ0fSczrbrfzlBys=; b=nNgGyGlDhossD3IYkb7cZKYY6OTPoNH8VrLUz3yxK10gKKdjyEc06AeqtI+xzgtn9y asAtORaqqju5OGinAGYQT6QTzMzhZsfzGfhPtkcdO8LPVWc2y8I9eg3YfZCfb5rQbGlR mkCp1zVc1X3KE2JNODRI9ebbUz2QijrYD3qOBu37zA0KFAMiQalNd2sOI1sT7bgWMtAv k536RBeeAvvx9NGQeBeWob19O+jR9mNz0ckCjgjou+XfHhFTddN4vj+/DbuTobM+Tqtu /0HFT+Zkj3nV5Fw7lqZHUFqv6/cdZ3XLJav+YMeCGKWKbdj/If+WJ4KsPVqYrXVPveCb mu0g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731252576; x=1731857376; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=ieWCO7ZvnQOESu0rOvKqnHS1oC/GZ0fSczrbrfzlBys=; b=bVntO8E2zn4w6gYpZHsnjw8fSoO4wyUr6jAJbUZajlVGEOsvc+/HcVJrTokoD6svNK bH56AGHJmkNg6FLIT/WpgJE9EV8QergGBvpbro5v5wckIvt8kADZr+Z7cJTuIfjoYA6s OO7uRuAWrCI//w7qmTN5ipLIaCtAW0MboWKNpyMYkNVd3P9Mhep3X+xFVXtqCtx7wF7W ngWpcaShn2xxhWO/yEmExyD1oJXsnkDb/1dZ0rWDS/PRYvyIkqCoVdlw5PGzapQ+3/IH iAvP7h5D3IesRAVyqw8xanZOj7IYezo3JnPxNbU6nLnSW/IyQ+4Lw1rmv2c2FClrfFjH lRwg== X-Forwarded-Encrypted: i=1; AJvYcCVG8Qiah9cLKrbbrmM7S/ZvWMhdvW3MUm2LECfOMeyhmEZ1r4wJ2L6zj88Mu5eYQlnXNKdO3s6rKbudAbQ=@vger.kernel.org X-Gm-Message-State: AOJu0Yyb3HJRNCPOZJKSLlZSM2+VKTebsWCSd5jHqRmMMwLAegY281Zi 8nk1cBJ+cTPMexyfIzoaa2psPn5WAT2dgVu6qRRh9l0n+elZCVzwG9jAnhYUYBI= X-Google-Smtp-Source: AGHT+IHHwm5ckcwW6tzDCyvD/cFUv8RX7EzbVytEaqdSin9tImlfNCqPLYYgjj9DyXwqjBBAmsxOog== X-Received: by 2002:a17:90b:180a:b0:2e2:d434:854c with SMTP id 98e67ed59e1d1-2e9b1655951mr13339967a91.2.1731252575687; Sun, 10 Nov 2024 07:29:35 -0800 (PST) Received: from localhost.localdomain ([198.8.77.157]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2e99a5f935dsm9940973a91.35.2024.11.10.07.29.34 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 10 Nov 2024 07:29:35 -0800 (PST) From: Jens Axboe To: linux-mm@kvack.org, linux-fsdevel@vger.kernel.org Cc: hannes@cmpxchg.org, clm@meta.com, linux-kernel@vger.kernel.org, willy@infradead.org, Jens Axboe Subject: [PATCH 15/15] xfs: flag as supporting FOP_UNCACHED Date: Sun, 10 Nov 2024 08:28:07 -0700 Message-ID: <20241110152906.1747545-16-axboe@kernel.dk> X-Mailer: git-send-email 2.45.2 In-Reply-To: <20241110152906.1747545-1-axboe@kernel.dk> References: <20241110152906.1747545-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Read side was already fully supported, for the write side all that's needed now is calling generic_uncached_write() when uncached writes have been submitted. With that, enable the use of RWF_UNCACHED with XFS by flagging support with FOP_UNCACHED. Signed-off-by: Jens Axboe --- fs/xfs/xfs_file.c | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/fs/xfs/xfs_file.c b/fs/xfs/xfs_file.c index b19916b11fd5..1a7f46e13464 100644 --- a/fs/xfs/xfs_file.c +++ b/fs/xfs/xfs_file.c @@ -825,6 +825,7 @@ xfs_file_buffered_write( =20 if (ret > 0) { XFS_STATS_ADD(ip->i_mount, xs_write_bytes, ret); + generic_uncached_write(iocb, ret); /* Handle various SYNC-type writes */ ret =3D generic_write_sync(iocb, ret); } @@ -1595,7 +1596,8 @@ const struct file_operations xfs_file_operations =3D { .fadvise =3D xfs_file_fadvise, .remap_file_range =3D xfs_file_remap_range, .fop_flags =3D FOP_MMAP_SYNC | FOP_BUFFER_RASYNC | - FOP_BUFFER_WASYNC | FOP_DIO_PARALLEL_WRITE, + FOP_BUFFER_WASYNC | FOP_DIO_PARALLEL_WRITE | + FOP_UNCACHED, }; =20 const struct file_operations xfs_dir_file_operations =3D { --=20 2.45.2