From nobody Sun Feb  8 06:42:35 2026
Received: from NAM11-DM6-obe.outbound.protection.outlook.com
 (mail-dm6nam11on2049.outbound.protection.outlook.com [40.107.223.49])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by smtp.subspace.kernel.org (Postfix) with ESMTPS id 29507293B7E
	for <linux-kernel@vger.kernel.org>; Tue, 22 Apr 2025 08:29:37 +0000 (UTC)
Authentication-Results: smtp.subspace.kernel.org;
 arc=fail smtp.client-ip=40.107.223.49
ARC-Seal: i=2; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116;
	t=1745310580; cv=fail;
 b=mfzk0dnKO0YPRXXIj8+wxt4ond3AzJw1uanos3eDSm1L9KL4pxJk5/OlGVcfwMpWqMgcEw65c9ut/F/lY1XwMfhRnMSrvtROEdQFg8R3FRnk7SAQ7I8Y6Wa/3O56a5oUjn++tm2jxfW/vtL0QF7JNQVWEmcETO6g8zSE8euBIfk=
ARC-Message-Signature: i=2; a=rsa-sha256; d=subspace.kernel.org;
	s=arc-20240116; t=1745310580; c=relaxed/simple;
	bh=Yoou5HypLtkAq43JdoD1Nsx1BkZnCmWwSF+l3L2MgoE=;
	h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References:
	 Content-Type:MIME-Version;
 b=bKF2zOjyvBgO1RkGNRBuklbZF4EvZUDobZVqAvWkoajIwf0aHm3fvFmpWNy9UKlVUzDm749ICH5nn++5lBfNhgP44PK0mgK1/cqqZ3U4A5K3+5kOECHNHrYCNkcYfu5pCYcD93zPhPPm99g37R/qcmy2gsf8cwu2aIubAzwivI4=
ARC-Authentication-Results: i=2; smtp.subspace.kernel.org;
 dmarc=pass (p=reject dis=none) header.from=nvidia.com;
 spf=fail smtp.mailfrom=nvidia.com;
 dkim=pass (2048-bit key) header.d=Nvidia.com header.i=@Nvidia.com
 header.b=fHPVDkEn; arc=fail smtp.client-ip=40.107.223.49
Authentication-Results: smtp.subspace.kernel.org;
 dmarc=pass (p=reject dis=none) header.from=nvidia.com
Authentication-Results: smtp.subspace.kernel.org;
 spf=fail smtp.mailfrom=nvidia.com
Authentication-Results: smtp.subspace.kernel.org;
	dkim=pass (2048-bit key) header.d=Nvidia.com header.i=@Nvidia.com
 header.b="fHPVDkEn"
ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none;
 b=nQdni7zYrg4KZ7SaPTSQwvhpa9LwQXBp0IQnq4dR/U+h+1L1hwGSNh9kg1JL59oDPwSgTf9YYvooMhaAVfINBF7VChAWYkwiHS+C3W4boXAKZAXQjT6qO/d36a1QNabjatVMo581am+tJ6Lmq8qN2VaIdyn8IsnwzJS0FU6uIyESqphtSEfXafvBdxgs11mpR1xsqbJ6F9zO3MjYJ6KOciv/3ZNidxqLIfy8640co73hXQwQA022b6i4Wu6c6Sx42ex2Iix42VnplS+QMn4VPCpnW21x1s1pvtc9QCC3DpkUaN5T+5vj+k0a8spFxBiHm8GPeZHo2suDMhVxhqMbHQ==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com;
 s=arcselector10001;
 h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1;
 bh=/yhzWkjxrGIPHQe9AvPcP8G8++9y7sNCCc4A0cAjIEw=;
 b=eUBw8SrxJVUHDoXD2VAQZFum3Lsrdspc25/lDePDPxx/IGt2fqPwSyxOd6JYERnHRqlLmvh3vDb1OH6GDk2tWOy2TiZcRXIpISFaFYHB2G7+lefjcYZ6CiyRSZ4MCfYnms2uWvum2UfJNFoN9PpCq5pmmycD8NuFg933cH7i0Kj8W+gNhbKBqQ0Y7MY8ptQmMPlEJFM+6JUSczyZ13aVXjpRAvmpfQJycDkSo+IHyxKcgxTE3jUojr9KeejyGScI8GI2xkmZ5XLIwqU3e4jvUo+rprSdOiEukoWrFeg1UKz4ebDOVWjDgjYvrN3mZ5znRHFB2jnN5w5lLeaTaub7hg==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass
 smtp.mailfrom=nvidia.com; dmarc=pass action=none header.from=nvidia.com;
 dkim=pass header.d=nvidia.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=Nvidia.com;
 s=selector2;
 h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck;
 bh=/yhzWkjxrGIPHQe9AvPcP8G8++9y7sNCCc4A0cAjIEw=;
 b=fHPVDkEnS+nacB9lHhB3ytdivWaZjRWrbZViH6DZwV3juQctUGwKXhJMIg8Y8ST1ZUEKaaP5lnLomXdgoa+nzOKx6DFDzdG3xdAQj77kUlNWPdBEK7L5tWPov1W8ICydERlZr0/y6vfU/Yac6XzpjjgvqT0kh7QdOCgwp9qNWk+zE5cDtput193cWFdnBaVSw+z/fUCkyA/hWD7QFiC/FJRxOOwPnXntIlyGrrVMDLsupppJZShn+6Ce9hKBT6WcDrSEYkj7XsHXle8b9Xere0QuZuV3jJ8LhRvHgglL5+io65Go0k+cvoHXWU5oGOJb6rjj/UHBL4/vjCagJI4jVA==
Authentication-Results: dkim=none (message not signed)
 header.d=none;dmarc=none action=none header.from=nvidia.com;
Received: from CY5PR12MB6405.namprd12.prod.outlook.com (2603:10b6:930:3e::17)
 by SN7PR12MB8603.namprd12.prod.outlook.com (2603:10b6:806:260::15) with
 Microsoft SMTP Server (version=TLS1_2,
 cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.8655.33; Tue, 22 Apr
 2025 08:29:31 +0000
Received: from CY5PR12MB6405.namprd12.prod.outlook.com
 ([fe80::2119:c96c:b455:53b5]) by CY5PR12MB6405.namprd12.prod.outlook.com
 ([fe80::2119:c96c:b455:53b5%6]) with mapi id 15.20.8655.033; Tue, 22 Apr 2025
 08:29:31 +0000
From: Andrea Righi <arighi@nvidia.com>
To: Tejun Heo <tj@kernel.org>,
	David Vernet <void@manifault.com>,
	Changwoo Min <changwoo@igalia.com>
Cc: linux-kernel@vger.kernel.org
Subject: [PATCH 1/2] sched_ext: Track currently locked rq
Date: Tue, 22 Apr 2025 10:26:32 +0200
Message-ID: <20250422082907.110167-2-arighi@nvidia.com>
X-Mailer: git-send-email 2.49.0
In-Reply-To: <20250422082907.110167-1-arighi@nvidia.com>
References: <20250422082907.110167-1-arighi@nvidia.com>
Content-Transfer-Encoding: quoted-printable
X-ClientProxiedBy: ZR2P278CA0030.CHEP278.PROD.OUTLOOK.COM
 (2603:10a6:910:46::20) To CY5PR12MB6405.namprd12.prod.outlook.com
 (2603:10b6:930:3e::17)
Precedence: bulk
X-Mailing-List: linux-kernel@vger.kernel.org
List-Id: <linux-kernel.vger.kernel.org>
List-Subscribe: <mailto:linux-kernel+subscribe@vger.kernel.org>
List-Unsubscribe: <mailto:linux-kernel+unsubscribe@vger.kernel.org>
MIME-Version: 1.0
X-MS-PublicTrafficType: Email
X-MS-TrafficTypeDiagnostic: CY5PR12MB6405:EE_|SN7PR12MB8603:EE_
X-MS-Office365-Filtering-Correlation-Id: 43cd1425-b59d-4af2-cd45-08dd8177ce0e
X-MS-Exchange-SenderADCheck: 1
X-MS-Exchange-AntiSpam-Relay: 0
X-Microsoft-Antispam: BCL:0;ARA:13230040|376014|366016|1800799024|7053199007;
X-Microsoft-Antispam-Message-Info: 
	=?us-ascii?Q?/PRga5C/CcSUKc7H/Z48TtGAuNsWC1zC1sygi9sa0lmG0g8TAaLVgGzBjRp3?=
 =?us-ascii?Q?/qAe8jugTObeQ323rwJEERxKUn7WsACRbbBKD/XwmGiYp4MWG+pYn8aUZ8XU?=
 =?us-ascii?Q?rGUFzMTTj3t0gldgV1dPKTl2xMmq84GmHgF/60lUvEV1BA8/6OU14SKQYBz6?=
 =?us-ascii?Q?YZPjIU3ZqGYyzRyHaXrTH3lxpwLvR33Oyp81jwRQ/6ZXDFgBcy5rM/2tefbr?=
 =?us-ascii?Q?27XzCJt8CA7UDg556W6EyN6tsaoMU0Tahj89PzxYTsRL8V8cEW1Y4b6ZZ+JI?=
 =?us-ascii?Q?FkAS245us/YB6RYYgmuuJgOCemVlfmuGsHxvcH52LPWhzJ0rMx/L0cFQXsJx?=
 =?us-ascii?Q?nHVgFjPozZaSowavheI0cJ+eTrBSCVpesS7L0SEir3esVKhi+qgbsQAoFXKd?=
 =?us-ascii?Q?eySnJqg/3zCIZ6D3YtpNCn40wNVwq/ee9c8hLvgjkeM47NwV2RqdNEOJz7az?=
 =?us-ascii?Q?BsJC4szgqa/ux3NASbXMJ3Ifv4FL8fExIe2DLil/bwhj227+yYhpXaGMNH9b?=
 =?us-ascii?Q?SP6CSme3vSSKEgXRjlVklCZeqza6MIzv4/UmbzkrErI6Hs8Onn+NatFokBPr?=
 =?us-ascii?Q?NGlHLmParXuMMeeEg0xydV1fxrKvrGmhtWZEgxkk4OCL/OfFN4tRg1NWOdrQ?=
 =?us-ascii?Q?jgr/ugzdTFpiUKfyItyF+fNVN2RNLzonQxBMJ6Rr+lPqn95MIEETk/dCudjh?=
 =?us-ascii?Q?xGnfeH+IbyeO8Rgf5FVMGZfayKZZJFv2wbiLyP2QVijF0SQX7XJrIzYIzuOo?=
 =?us-ascii?Q?zAXhI2fSjTkDxG289hvl8Tda/giIszvucgv/YyNSYlsmTlnp7m78U/c3hlV4?=
 =?us-ascii?Q?adJLCM+PjMsKn5lqiUrxHaBBvlFJpL3bcSqWZbYneezrW04RBbOegYanj9+0?=
 =?us-ascii?Q?dyzkW0UtRD2IiHYx2GnzxmtrvZL26PVtZePU0sdfA0gs+FmCy5uqGlJtCLKL?=
 =?us-ascii?Q?un+AYWEE6FNg0jCbjos9jTMRd+opbI8roLjo9fXRBFu6iuySnN86bW4Has8S?=
 =?us-ascii?Q?X6S5NekABVI8BBCeNF7fcjc78/rfBMhKRtUI8T/U03Y3ox7zopvNzbATeT76?=
 =?us-ascii?Q?GraRiZB87AaveegZUwz9I+m4M+bqtVhNID7OvEok7NtYbnJOJavj72LmWhv3?=
 =?us-ascii?Q?qsRTd+Y07pKFfHzn71EHB8v51nExa0TN5vtddbaESSKRFm4bYfTQs1u/hDzx?=
 =?us-ascii?Q?k7Xpbr32soeH/2WW9gWbbCeWFPQNI5WdIDbEKDWkVV7vJhMQc9dTY0oipjNv?=
 =?us-ascii?Q?MQFxy9GmkHyL93reIyt9DVW9xFcoHV9r5T6AJfZLabqhXGxQhJhD6EkfL0e5?=
 =?us-ascii?Q?VhPvuYIBMs2J1i1KMdyZET2PW6G4mt9kkCG/RXg911Z3olrcgsgWVmYFtHtW?=
 =?us-ascii?Q?jqj86cU7elgkCUEOUFADdVQ7uGEuE1SQPYYbfCWlNtYj5Zp15Nk57NYrnxgG?=
 =?us-ascii?Q?LXvcLqA5CfM=3D?=
X-Forefront-Antispam-Report: 
	CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:CY5PR12MB6405.namprd12.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230040)(376014)(366016)(1800799024)(7053199007);DIR:OUT;SFP:1101;
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: 
	=?us-ascii?Q?+QHXHKiMB/rDCy+BfYcYTx1PWyysxd9oZ4YphuwDtWa29fQ8vGkgZZOKEmkd?=
 =?us-ascii?Q?CUqXXFwyBa97OY3PoCZNXqPaVDKmNUyvEkw3gy1XIXg/0taQnph27m8CmKAF?=
 =?us-ascii?Q?OdAiWDrjymmRdDmDfLNx+7Lr8vZs+4EZMgBasRwAbatyPjovSgawID04DNET?=
 =?us-ascii?Q?L8I+Kj1zZBMZ4mOfihgokLF+ihtDNyTJijRzcie5lBLROnzv5EliqdQEUooN?=
 =?us-ascii?Q?Wln1ID/lmK3KAUEwNDENdCofj43X0+n+aNeMMukAz4eGypt0NrgCA2fHKWzi?=
 =?us-ascii?Q?4lyYRBYc4AG+R6P+pZ1yUsvEZbQyTqgzWOB/KEdoOQK21Y4RZiXga/js0zo/?=
 =?us-ascii?Q?zz/HuyzYPYR3q69GH3h7a/iP0D6LZr47BTq8+6D8WBMVTaLjkfCn3p1m6I6x?=
 =?us-ascii?Q?+gwIeXtRHGsIoaTb0JQDa/DOLhSARGtacARxF9Vl1ZniQqiSPSGCu93eCLDy?=
 =?us-ascii?Q?vmT07ylWHJF/8oIMdrA3A6jKnBvXXRGXGEMaHleWYSjkD3mKkLExsJCOr5GW?=
 =?us-ascii?Q?Ltu+NxGUBlGqudvWUzWaDlC6ukQaZ8bnd5yAbtJ1NNXIo4ae2u5XOElnWAxB?=
 =?us-ascii?Q?CUj/TbC71m/cKFYZSMBp0G+BMPJmo89mCt7RvZ9v6Yxi0X45v4u46eATfWNo?=
 =?us-ascii?Q?GcUgP8nex6YgiXNOQ/cdb+sfzOE5zH1AX4BHhpf9m4sCPzIc2f8IfqShpV44?=
 =?us-ascii?Q?hT5G532Z6tr28RIVBsLQaqfFRsl9bP4wV8QtaG+GQAy3h5hs3f3gHyZrDgLc?=
 =?us-ascii?Q?rwagvm8x4vftwZjfGA669uqt1T1SpolKUMvCpPn0pgAnC9Vc3MQ+G6BtURUY?=
 =?us-ascii?Q?hXZJr6rkuTnUoPUJ8MCG5582VLp+/+6uKPQq8QpLXJXBXu4C9SUIsMg8Qf37?=
 =?us-ascii?Q?5B5hUhNgy/RXScPG9tr3lWfcBSGxWodQQhMNCE+D+k4E/1QXQtcOSLVuz4B3?=
 =?us-ascii?Q?blompQihvKaxPt3lQDCOZZbOyhkngzt/fmcY6aEfws3c84Fr4DT8icrdtv9x?=
 =?us-ascii?Q?VuNf8aPZUgPF1P1s5UJO1lPKZsvdNsol4e8wKbS2rotKth2kx3VtTU94TCyM?=
 =?us-ascii?Q?xvkBeB7rz+Byj0fk7Os1smrHlN3vABgBlXL839TSxY2JPml+D5cqKcTeznFx?=
 =?us-ascii?Q?solxBEq4cdnMxw6s9TxxCnOnh81PFIavegDQJs0b5pGSsY1hKtsCxKnXdt9/?=
 =?us-ascii?Q?lPtFvvNIvnJlu0C4L8d6eppzt/oYj5LIdJsE5Z7w+T8rpdwfKovwGc9fRmO8?=
 =?us-ascii?Q?OrtFzUZUr7S6CYmRzqXcag5puGXrUH/HMPrLuyT9981t2snuRiWgIWXSht6R?=
 =?us-ascii?Q?TRt+EOEz2MokehzRa0nhMZbjO1/4nW4mpGhDw/wK9wy4wI9eAorcAvyuJEMF?=
 =?us-ascii?Q?Y07dsmL8+HkH72cVTrzHNV/kJCgy7mj4Vq/M1na+5LAIEJ9OhJTXdpLB+yzq?=
 =?us-ascii?Q?8OWeeaZs2ePVH4B92A6saQLjy6jrEbD+2yZYEpxPPczn2Fu8n/l24FkzhoB3?=
 =?us-ascii?Q?dBTsfV2Bqwf+3+tP3xPP+aZ9iJaS+bibp9JNmA4NirZxxzcqDiqrNhvmnKYT?=
 =?us-ascii?Q?KpAbx9vwtUNDkGwK/qBII5dedyWeo3iS1GrLplpk?=
X-OriginatorOrg: Nvidia.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 
 43cd1425-b59d-4af2-cd45-08dd8177ce0e
X-MS-Exchange-CrossTenant-AuthSource: CY5PR12MB6405.namprd12.prod.outlook.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 22 Apr 2025 08:29:31.6967
 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: 43083d15-7273-40c1-b7db-39efd9ccc17a
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: 
 w6vWIWNYuTdOkOb6eEryWZjpg3srByBFm0u6Z9ZF9l7rkmevHWVpsq319HSi8JvZUW3ZyYxQvSx2fMLE8N7Xqg==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: SN7PR12MB8603
Content-Type: text/plain; charset="utf-8"

Some kfuncs provided by sched_ext may need to operate on a struct rq,
but they can be invoked from various contexts, specifically, different
scx callbacks.

While some of these callbacks are invoked with a particular rq already
locked, others are not. This makes it impossible for a kfunc to reliably
determine whether it's safe to access a given rq, triggering potential
bugs or unsafe behaviors, see for example [1].

To address this, track the currently locked rq whenever a sched_ext
callback is invoked via SCX_CALL_OP*().

This allows kfuncs that need to operate on an arbitrary rq to retrieve
the currently locked one and apply the appropriate action as needed.

[1] https://lore.kernel.org/lkml/20250325140021.73570-1-arighi@nvidia.com/

Suggested-by: Tejun Heo <tj@kernel.org>
Signed-off-by: Andrea Righi <arighi@nvidia.com>
Acked-by: Andrea Righi <arighi@nvidia.com>
---
 kernel/sched/ext.c      | 152 +++++++++++++++++++++++++---------------
 kernel/sched/ext_idle.c |   2 +-
 2 files changed, 95 insertions(+), 59 deletions(-)

diff --git a/kernel/sched/ext.c b/kernel/sched/ext.c
index bb0873411d798..3365b447cbdb8 100644
--- a/kernel/sched/ext.c
+++ b/kernel/sched/ext.c
@@ -1116,8 +1116,38 @@ static void scx_kf_disallow(u32 mask)
 	current->scx.kf_mask &=3D ~mask;
 }
=20
-#define SCX_CALL_OP(mask, op, args...)						\
+/*
+ * Track the rq currently locked.
+ *
+ * This allows kfuncs to safely operate on rq from any scx ops callback,
+ * knowing which rq is already locked.
+ */
+static DEFINE_PER_CPU(struct rq *, locked_rq);
+
+static inline void update_locked_rq(struct rq *rq)
+{
+	/*
+	 * Check whether @rq is actually locked. This can help expose bugs
+	 * or incorrect assumptions about the context in which a kfunc or
+	 * callback is executed.
+	 */
+	if (rq)
+		lockdep_assert_rq_held(rq);
+	__this_cpu_write(locked_rq, rq);
+}
+
+/*
+ * Return the rq currently locked from an scx callback, or NULL if no rq is
+ * locked.
+ */
+static inline struct rq *scx_locked_rq(void)
+{
+	return __this_cpu_read(locked_rq);
+}
+
+#define SCX_CALL_OP(mask, op, rq, args...)					\
 do {										\
+	update_locked_rq(rq);							\
 	if (mask) {								\
 		scx_kf_allow(mask);						\
 		scx_ops.op(args);						\
@@ -1125,11 +1155,14 @@ do {										\
 	} else {								\
 		scx_ops.op(args);						\
 	}									\
+	update_locked_rq(NULL);							\
 } while (0)
=20
-#define SCX_CALL_OP_RET(mask, op, args...)					\
+#define SCX_CALL_OP_RET(mask, op, rq, args...)					\
 ({										\
 	__typeof__(scx_ops.op(args)) __ret;					\
+										\
+	update_locked_rq(rq);							\
 	if (mask) {								\
 		scx_kf_allow(mask);						\
 		__ret =3D scx_ops.op(args);					\
@@ -1137,6 +1170,7 @@ do {										\
 	} else {								\
 		__ret =3D scx_ops.op(args);					\
 	}									\
+	update_locked_rq(NULL);							\
 	__ret;									\
 })
=20
@@ -1151,31 +1185,31 @@ do {										\
  * scx_kf_allowed_on_arg_tasks() to test whether the invocation is allowed=
 on
  * the specific task.
  */
-#define SCX_CALL_OP_TASK(mask, op, task, args...)				\
+#define SCX_CALL_OP_TASK(mask, op, rq, task, args...)				\
 do {										\
 	BUILD_BUG_ON((mask) & ~__SCX_KF_TERMINAL);				\
 	current->scx.kf_tasks[0] =3D task;					\
-	SCX_CALL_OP(mask, op, task, ##args);					\
+	SCX_CALL_OP(mask, op, rq, task, ##args);				\
 	current->scx.kf_tasks[0] =3D NULL;					\
 } while (0)
=20
-#define SCX_CALL_OP_TASK_RET(mask, op, task, args...)				\
+#define SCX_CALL_OP_TASK_RET(mask, op, rq, task, args...)			\
 ({										\
 	__typeof__(scx_ops.op(task, ##args)) __ret;				\
 	BUILD_BUG_ON((mask) & ~__SCX_KF_TERMINAL);				\
 	current->scx.kf_tasks[0] =3D task;					\
-	__ret =3D SCX_CALL_OP_RET(mask, op, task, ##args);			\
+	__ret =3D SCX_CALL_OP_RET(mask, op, rq, task, ##args);			\
 	current->scx.kf_tasks[0] =3D NULL;					\
 	__ret;									\
 })
=20
-#define SCX_CALL_OP_2TASKS_RET(mask, op, task0, task1, args...)			\
+#define SCX_CALL_OP_2TASKS_RET(mask, op, rq, task0, task1, args...)		\
 ({										\
 	__typeof__(scx_ops.op(task0, task1, ##args)) __ret;			\
 	BUILD_BUG_ON((mask) & ~__SCX_KF_TERMINAL);				\
 	current->scx.kf_tasks[0] =3D task0;					\
 	current->scx.kf_tasks[1] =3D task1;					\
-	__ret =3D SCX_CALL_OP_RET(mask, op, task0, task1, ##args);		\
+	__ret =3D SCX_CALL_OP_RET(mask, op, rq, task0, task1, ##args);		\
 	current->scx.kf_tasks[0] =3D NULL;					\
 	current->scx.kf_tasks[1] =3D NULL;					\
 	__ret;									\
@@ -2174,7 +2208,7 @@ static void do_enqueue_task(struct rq *rq, struct tas=
k_struct *p, u64 enq_flags,
 	WARN_ON_ONCE(*ddsp_taskp);
 	*ddsp_taskp =3D p;
=20
-	SCX_CALL_OP_TASK(SCX_KF_ENQUEUE, enqueue, p, enq_flags);
+	SCX_CALL_OP_TASK(SCX_KF_ENQUEUE, enqueue, rq, p, enq_flags);
=20
 	*ddsp_taskp =3D NULL;
 	if (p->scx.ddsp_dsq_id !=3D SCX_DSQ_INVALID)
@@ -2269,7 +2303,7 @@ static void enqueue_task_scx(struct rq *rq, struct ta=
sk_struct *p, int enq_flags
 	add_nr_running(rq, 1);
=20
 	if (SCX_HAS_OP(runnable) && !task_on_rq_migrating(p))
-		SCX_CALL_OP_TASK(SCX_KF_REST, runnable, p, enq_flags);
+		SCX_CALL_OP_TASK(SCX_KF_REST, runnable, rq, p, enq_flags);
=20
 	if (enq_flags & SCX_ENQ_WAKEUP)
 		touch_core_sched(rq, p);
@@ -2283,7 +2317,7 @@ static void enqueue_task_scx(struct rq *rq, struct ta=
sk_struct *p, int enq_flags
 		__scx_add_event(SCX_EV_SELECT_CPU_FALLBACK, 1);
 }
=20
-static void ops_dequeue(struct task_struct *p, u64 deq_flags)
+static void ops_dequeue(struct rq *rq, struct task_struct *p, u64 deq_flag=
s)
 {
 	unsigned long opss;
=20
@@ -2304,7 +2338,7 @@ static void ops_dequeue(struct task_struct *p, u64 de=
q_flags)
 		BUG();
 	case SCX_OPSS_QUEUED:
 		if (SCX_HAS_OP(dequeue))
-			SCX_CALL_OP_TASK(SCX_KF_REST, dequeue, p, deq_flags);
+			SCX_CALL_OP_TASK(SCX_KF_REST, dequeue, rq, p, deq_flags);
=20
 		if (atomic_long_try_cmpxchg(&p->scx.ops_state, &opss,
 					    SCX_OPSS_NONE))
@@ -2337,7 +2371,7 @@ static bool dequeue_task_scx(struct rq *rq, struct ta=
sk_struct *p, int deq_flags
 		return true;
 	}
=20
-	ops_dequeue(p, deq_flags);
+	ops_dequeue(rq, p, deq_flags);
=20
 	/*
 	 * A currently running task which is going off @rq first gets dequeued
@@ -2353,11 +2387,11 @@ static bool dequeue_task_scx(struct rq *rq, struct =
task_struct *p, int deq_flags
 	 */
 	if (SCX_HAS_OP(stopping) && task_current(rq, p)) {
 		update_curr_scx(rq);
-		SCX_CALL_OP_TASK(SCX_KF_REST, stopping, p, false);
+		SCX_CALL_OP_TASK(SCX_KF_REST, stopping, rq, p, false);
 	}
=20
 	if (SCX_HAS_OP(quiescent) && !task_on_rq_migrating(p))
-		SCX_CALL_OP_TASK(SCX_KF_REST, quiescent, p, deq_flags);
+		SCX_CALL_OP_TASK(SCX_KF_REST, quiescent, rq, p, deq_flags);
=20
 	if (deq_flags & SCX_DEQ_SLEEP)
 		p->scx.flags |=3D SCX_TASK_DEQD_FOR_SLEEP;
@@ -2377,7 +2411,7 @@ static void yield_task_scx(struct rq *rq)
 	struct task_struct *p =3D rq->curr;
=20
 	if (SCX_HAS_OP(yield))
-		SCX_CALL_OP_2TASKS_RET(SCX_KF_REST, yield, p, NULL);
+		SCX_CALL_OP_2TASKS_RET(SCX_KF_REST, yield, rq, p, NULL);
 	else
 		p->scx.slice =3D 0;
 }
@@ -2387,7 +2421,7 @@ static bool yield_to_task_scx(struct rq *rq, struct t=
ask_struct *to)
 	struct task_struct *from =3D rq->curr;
=20
 	if (SCX_HAS_OP(yield))
-		return SCX_CALL_OP_2TASKS_RET(SCX_KF_REST, yield, from, to);
+		return SCX_CALL_OP_2TASKS_RET(SCX_KF_REST, yield, rq, from, to);
 	else
 		return false;
 }
@@ -2945,7 +2979,7 @@ static int balance_one(struct rq *rq, struct task_str=
uct *prev)
 		 * emitted in switch_class().
 		 */
 		if (SCX_HAS_OP(cpu_acquire))
-			SCX_CALL_OP(SCX_KF_REST, cpu_acquire, cpu_of(rq), NULL);
+			SCX_CALL_OP(SCX_KF_REST, cpu_acquire, rq, cpu_of(rq), NULL);
 		rq->scx.cpu_released =3D false;
 	}
=20
@@ -2990,7 +3024,7 @@ static int balance_one(struct rq *rq, struct task_str=
uct *prev)
 	do {
 		dspc->nr_tasks =3D 0;
=20
-		SCX_CALL_OP(SCX_KF_DISPATCH, dispatch, cpu_of(rq),
+		SCX_CALL_OP(SCX_KF_DISPATCH, dispatch, rq, cpu_of(rq),
 			    prev_on_scx ? prev : NULL);
=20
 		flush_dispatch_buf(rq);
@@ -3104,7 +3138,7 @@ static void set_next_task_scx(struct rq *rq, struct t=
ask_struct *p, bool first)
 		 * Core-sched might decide to execute @p before it is
 		 * dispatched. Call ops_dequeue() to notify the BPF scheduler.
 		 */
-		ops_dequeue(p, SCX_DEQ_CORE_SCHED_EXEC);
+		ops_dequeue(rq, p, SCX_DEQ_CORE_SCHED_EXEC);
 		dispatch_dequeue(rq, p);
 	}
=20
@@ -3112,7 +3146,7 @@ static void set_next_task_scx(struct rq *rq, struct t=
ask_struct *p, bool first)
=20
 	/* see dequeue_task_scx() on why we skip when !QUEUED */
 	if (SCX_HAS_OP(running) && (p->scx.flags & SCX_TASK_QUEUED))
-		SCX_CALL_OP_TASK(SCX_KF_REST, running, p);
+		SCX_CALL_OP_TASK(SCX_KF_REST, running, rq, p);
=20
 	clr_task_runnable(p, true);
=20
@@ -3193,8 +3227,7 @@ static void switch_class(struct rq *rq, struct task_s=
truct *next)
 				.task =3D next,
 			};
=20
-			SCX_CALL_OP(SCX_KF_CPU_RELEASE,
-				    cpu_release, cpu_of(rq), &args);
+			SCX_CALL_OP(SCX_KF_CPU_RELEASE, cpu_release, rq, cpu_of(rq), &args);
 		}
 		rq->scx.cpu_released =3D true;
 	}
@@ -3207,7 +3240,7 @@ static void put_prev_task_scx(struct rq *rq, struct t=
ask_struct *p,
=20
 	/* see dequeue_task_scx() on why we skip when !QUEUED */
 	if (SCX_HAS_OP(stopping) && (p->scx.flags & SCX_TASK_QUEUED))
-		SCX_CALL_OP_TASK(SCX_KF_REST, stopping, p, true);
+		SCX_CALL_OP_TASK(SCX_KF_REST, stopping, rq, p, true);
=20
 	if (p->scx.flags & SCX_TASK_QUEUED) {
 		set_task_runnable(rq, p);
@@ -3345,7 +3378,7 @@ bool scx_prio_less(const struct task_struct *a, const=
 struct task_struct *b,
 	 * verifier.
 	 */
 	if (SCX_HAS_OP(core_sched_before) && !scx_rq_bypassing(task_rq(a)))
-		return SCX_CALL_OP_2TASKS_RET(SCX_KF_REST, core_sched_before,
+		return SCX_CALL_OP_2TASKS_RET(SCX_KF_REST, core_sched_before, NULL,
 					      (struct task_struct *)a,
 					      (struct task_struct *)b);
 	else
@@ -3382,7 +3415,7 @@ static int select_task_rq_scx(struct task_struct *p, =
int prev_cpu, int wake_flag
 		*ddsp_taskp =3D p;
=20
 		cpu =3D SCX_CALL_OP_TASK_RET(SCX_KF_ENQUEUE | SCX_KF_SELECT_CPU,
-					   select_cpu, p, prev_cpu, wake_flags);
+					   select_cpu, NULL, p, prev_cpu, wake_flags);
 		p->scx.selected_cpu =3D cpu;
 		*ddsp_taskp =3D NULL;
 		if (ops_cpu_valid(cpu, "from ops.select_cpu()"))
@@ -3426,8 +3459,8 @@ static void set_cpus_allowed_scx(struct task_struct *=
p,
 	 * designation pointless. Cast it away when calling the operation.
 	 */
 	if (SCX_HAS_OP(set_cpumask))
-		SCX_CALL_OP_TASK(SCX_KF_REST, set_cpumask, p,
-				 (struct cpumask *)p->cpus_ptr);
+		SCX_CALL_OP_TASK(SCX_KF_REST, set_cpumask, NULL,
+				 p, (struct cpumask *)p->cpus_ptr);
 }
=20
 static void handle_hotplug(struct rq *rq, bool online)
@@ -3440,9 +3473,9 @@ static void handle_hotplug(struct rq *rq, bool online)
 		scx_idle_update_selcpu_topology(&scx_ops);
=20
 	if (online && SCX_HAS_OP(cpu_online))
-		SCX_CALL_OP(SCX_KF_UNLOCKED, cpu_online, cpu);
+		SCX_CALL_OP(SCX_KF_UNLOCKED, cpu_online, rq, cpu);
 	else if (!online && SCX_HAS_OP(cpu_offline))
-		SCX_CALL_OP(SCX_KF_UNLOCKED, cpu_offline, cpu);
+		SCX_CALL_OP(SCX_KF_UNLOCKED, cpu_offline, rq, cpu);
 	else
 		scx_exit(SCX_ECODE_ACT_RESTART | SCX_ECODE_RSN_HOTPLUG,
 			 "cpu %d going %s, exiting scheduler", cpu,
@@ -3545,7 +3578,7 @@ static void task_tick_scx(struct rq *rq, struct task_=
struct *curr, int queued)
 		curr->scx.slice =3D 0;
 		touch_core_sched(rq, curr);
 	} else if (SCX_HAS_OP(tick)) {
-		SCX_CALL_OP_TASK(SCX_KF_REST, tick, curr);
+		SCX_CALL_OP_TASK(SCX_KF_REST, tick, rq, curr);
 	}
=20
 	if (!curr->scx.slice)
@@ -3622,7 +3655,7 @@ static int scx_init_task(struct task_struct *p, struc=
t task_group *tg, bool fork
 			.fork =3D fork,
 		};
=20
-		ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, init_task, p, &args);
+		ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, init_task, NULL, p, &args);
 		if (unlikely(ret)) {
 			ret =3D ops_sanitize_err("init_task", ret);
 			return ret;
@@ -3663,9 +3696,10 @@ static int scx_init_task(struct task_struct *p, stru=
ct task_group *tg, bool fork
=20
 static void scx_enable_task(struct task_struct *p)
 {
+	struct rq *rq =3D task_rq(p);
 	u32 weight;
=20
-	lockdep_assert_rq_held(task_rq(p));
+	lockdep_assert_rq_held(rq);
=20
 	/*
 	 * Set the weight before calling ops.enable() so that the scheduler
@@ -3679,20 +3713,22 @@ static void scx_enable_task(struct task_struct *p)
 	p->scx.weight =3D sched_weight_to_cgroup(weight);
=20
 	if (SCX_HAS_OP(enable))
-		SCX_CALL_OP_TASK(SCX_KF_REST, enable, p);
+		SCX_CALL_OP_TASK(SCX_KF_REST, enable, rq, p);
 	scx_set_task_state(p, SCX_TASK_ENABLED);
=20
 	if (SCX_HAS_OP(set_weight))
-		SCX_CALL_OP_TASK(SCX_KF_REST, set_weight, p, p->scx.weight);
+		SCX_CALL_OP_TASK(SCX_KF_REST, set_weight, rq, p, p->scx.weight);
 }
=20
 static void scx_disable_task(struct task_struct *p)
 {
-	lockdep_assert_rq_held(task_rq(p));
+	struct rq *rq =3D task_rq(p);
+
+	lockdep_assert_rq_held(rq);
 	WARN_ON_ONCE(scx_get_task_state(p) !=3D SCX_TASK_ENABLED);
=20
 	if (SCX_HAS_OP(disable))
-		SCX_CALL_OP_TASK(SCX_KF_REST, disable, p);
+		SCX_CALL_OP_TASK(SCX_KF_REST, disable, rq, p);
 	scx_set_task_state(p, SCX_TASK_READY);
 }
=20
@@ -3721,7 +3757,7 @@ static void scx_exit_task(struct task_struct *p)
 	}
=20
 	if (SCX_HAS_OP(exit_task))
-		SCX_CALL_OP_TASK(SCX_KF_REST, exit_task, p, &args);
+		SCX_CALL_OP_TASK(SCX_KF_REST, exit_task, task_rq(p), p, &args);
 	scx_set_task_state(p, SCX_TASK_NONE);
 }
=20
@@ -3830,7 +3866,7 @@ static void reweight_task_scx(struct rq *rq, struct t=
ask_struct *p,
=20
 	p->scx.weight =3D sched_weight_to_cgroup(scale_load_down(lw->weight));
 	if (SCX_HAS_OP(set_weight))
-		SCX_CALL_OP_TASK(SCX_KF_REST, set_weight, p, p->scx.weight);
+		SCX_CALL_OP_TASK(SCX_KF_REST, set_weight, rq, p, p->scx.weight);
 }
=20
 static void prio_changed_scx(struct rq *rq, struct task_struct *p, int old=
prio)
@@ -3846,8 +3882,8 @@ static void switching_to_scx(struct rq *rq, struct ta=
sk_struct *p)
 	 * different scheduler class. Keep the BPF scheduler up-to-date.
 	 */
 	if (SCX_HAS_OP(set_cpumask))
-		SCX_CALL_OP_TASK(SCX_KF_REST, set_cpumask, p,
-				 (struct cpumask *)p->cpus_ptr);
+		SCX_CALL_OP_TASK(SCX_KF_REST, set_cpumask, rq,
+				 p, (struct cpumask *)p->cpus_ptr);
 }
=20
 static void switched_from_scx(struct rq *rq, struct task_struct *p)
@@ -3908,7 +3944,7 @@ int scx_tg_online(struct task_group *tg)
 			struct scx_cgroup_init_args args =3D
 				{ .weight =3D tg->scx_weight };
=20
-			ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, cgroup_init,
+			ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, cgroup_init, NULL,
 					      tg->css.cgroup, &args);
 			if (ret)
 				ret =3D ops_sanitize_err("cgroup_init", ret);
@@ -3930,7 +3966,7 @@ void scx_tg_offline(struct task_group *tg)
 	percpu_down_read(&scx_cgroup_rwsem);
=20
 	if (SCX_HAS_OP(cgroup_exit) && (tg->scx_flags & SCX_TG_INITED))
-		SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_exit, tg->css.cgroup);
+		SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_exit, NULL, tg->css.cgroup);
 	tg->scx_flags &=3D ~(SCX_TG_ONLINE | SCX_TG_INITED);
=20
 	percpu_up_read(&scx_cgroup_rwsem);
@@ -3963,7 +3999,7 @@ int scx_cgroup_can_attach(struct cgroup_taskset *tset)
 			continue;
=20
 		if (SCX_HAS_OP(cgroup_prep_move)) {
-			ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, cgroup_prep_move,
+			ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, cgroup_prep_move, NULL,
 					      p, from, css->cgroup);
 			if (ret)
 				goto err;
@@ -3977,8 +4013,8 @@ int scx_cgroup_can_attach(struct cgroup_taskset *tset)
 err:
 	cgroup_taskset_for_each(p, css, tset) {
 		if (SCX_HAS_OP(cgroup_cancel_move) && p->scx.cgrp_moving_from)
-			SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_cancel_move, p,
-				    p->scx.cgrp_moving_from, css->cgroup);
+			SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_cancel_move, NULL,
+				    p, p->scx.cgrp_moving_from, css->cgroup);
 		p->scx.cgrp_moving_from =3D NULL;
 	}
=20
@@ -3996,8 +4032,8 @@ void scx_cgroup_move_task(struct task_struct *p)
 	 * cgrp_moving_from set.
 	 */
 	if (SCX_HAS_OP(cgroup_move) && !WARN_ON_ONCE(!p->scx.cgrp_moving_from))
-		SCX_CALL_OP_TASK(SCX_KF_UNLOCKED, cgroup_move, p,
-			p->scx.cgrp_moving_from, tg_cgrp(task_group(p)));
+		SCX_CALL_OP_TASK(SCX_KF_UNLOCKED, cgroup_move, NULL,
+				 p, p->scx.cgrp_moving_from, tg_cgrp(task_group(p)));
 	p->scx.cgrp_moving_from =3D NULL;
 }
=20
@@ -4016,8 +4052,8 @@ void scx_cgroup_cancel_attach(struct cgroup_taskset *=
tset)
=20
 	cgroup_taskset_for_each(p, css, tset) {
 		if (SCX_HAS_OP(cgroup_cancel_move) && p->scx.cgrp_moving_from)
-			SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_cancel_move, p,
-				    p->scx.cgrp_moving_from, css->cgroup);
+			SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_cancel_move, NULL,
+				    p, p->scx.cgrp_moving_from, css->cgroup);
 		p->scx.cgrp_moving_from =3D NULL;
 	}
 out_unlock:
@@ -4030,7 +4066,7 @@ void scx_group_set_weight(struct task_group *tg, unsi=
gned long weight)
=20
 	if (scx_cgroup_enabled && tg->scx_weight !=3D weight) {
 		if (SCX_HAS_OP(cgroup_set_weight))
-			SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_set_weight,
+			SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_set_weight, NULL,
 				    tg_cgrp(tg), weight);
 		tg->scx_weight =3D weight;
 	}
@@ -4219,7 +4255,7 @@ static void scx_cgroup_exit(void)
 			continue;
 		rcu_read_unlock();
=20
-		SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_exit, css->cgroup);
+		SCX_CALL_OP(SCX_KF_UNLOCKED, cgroup_exit, NULL, css->cgroup);
=20
 		rcu_read_lock();
 		css_put(css);
@@ -4256,7 +4292,7 @@ static int scx_cgroup_init(void)
 			continue;
 		rcu_read_unlock();
=20
-		ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, cgroup_init,
+		ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, cgroup_init, NULL,
 				      css->cgroup, &args);
 		if (ret) {
 			css_put(css);
@@ -4749,7 +4785,7 @@ static void scx_disable_workfn(struct kthread_work *w=
ork)
 	}
=20
 	if (scx_ops.exit)
-		SCX_CALL_OP(SCX_KF_UNLOCKED, exit, ei);
+		SCX_CALL_OP(SCX_KF_UNLOCKED, exit, NULL, ei);
=20
 	cancel_delayed_work_sync(&scx_watchdog_work);
=20
@@ -4955,7 +4991,7 @@ static void scx_dump_task(struct seq_buf *s, struct s=
cx_dump_ctx *dctx,
=20
 	if (SCX_HAS_OP(dump_task)) {
 		ops_dump_init(s, "    ");
-		SCX_CALL_OP(SCX_KF_REST, dump_task, dctx, p);
+		SCX_CALL_OP(SCX_KF_REST, dump_task, NULL, dctx, p);
 		ops_dump_exit();
 	}
=20
@@ -5002,7 +5038,7 @@ static void scx_dump_state(struct scx_exit_info *ei, =
size_t dump_len)
=20
 	if (SCX_HAS_OP(dump)) {
 		ops_dump_init(&s, "");
-		SCX_CALL_OP(SCX_KF_UNLOCKED, dump, &dctx);
+		SCX_CALL_OP(SCX_KF_UNLOCKED, dump, NULL, &dctx);
 		ops_dump_exit();
 	}
=20
@@ -5059,7 +5095,7 @@ static void scx_dump_state(struct scx_exit_info *ei, =
size_t dump_len)
 		used =3D seq_buf_used(&ns);
 		if (SCX_HAS_OP(dump_cpu)) {
 			ops_dump_init(&ns, "  ");
-			SCX_CALL_OP(SCX_KF_REST, dump_cpu, &dctx, cpu, idle);
+			SCX_CALL_OP(SCX_KF_REST, dump_cpu, NULL, &dctx, cpu, idle);
 			ops_dump_exit();
 		}
=20
@@ -5315,7 +5351,7 @@ static int scx_enable(struct sched_ext_ops *ops, stru=
ct bpf_link *link)
 	scx_idle_enable(ops);
=20
 	if (scx_ops.init) {
-		ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, init);
+		ret =3D SCX_CALL_OP_RET(SCX_KF_UNLOCKED, init, NULL);
 		if (ret) {
 			ret =3D ops_sanitize_err("init", ret);
 			cpus_read_unlock();
diff --git a/kernel/sched/ext_idle.c b/kernel/sched/ext_idle.c
index 023ae6df5e8ca..35aa309c95846 100644
--- a/kernel/sched/ext_idle.c
+++ b/kernel/sched/ext_idle.c
@@ -745,7 +745,7 @@ void __scx_update_idle(struct rq *rq, bool idle, bool d=
o_notify)
 	 * managed by put_prev_task_idle()/set_next_task_idle().
 	 */
 	if (SCX_HAS_OP(update_idle) && do_notify && !scx_rq_bypassing(rq))
-		SCX_CALL_OP(SCX_KF_REST, update_idle, cpu_of(rq), idle);
+		SCX_CALL_OP(SCX_KF_REST, update_idle, rq, cpu_of(rq), idle);
=20
 	/*
 	 * Update the idle masks:
--=20
2.49.0
From nobody Sun Feb  8 06:42:35 2026
Received: from NAM11-DM6-obe.outbound.protection.outlook.com
 (mail-dm6nam11on2049.outbound.protection.outlook.com [40.107.223.49])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by smtp.subspace.kernel.org (Postfix) with ESMTPS id 90B99293B79
	for <linux-kernel@vger.kernel.org>; Tue, 22 Apr 2025 08:29:40 +0000 (UTC)
Authentication-Results: smtp.subspace.kernel.org;
 arc=fail smtp.client-ip=40.107.223.49
ARC-Seal: i=2; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116;
	t=1745310582; cv=fail;
 b=M228OS3I6Fu76jBgJa/WeEHZW7kkQtAA0Ky8haavPCr5yjlJVHrXX86bCJe3cMyWu/QmLEAWd5Jmst8HyPDyMXJCrpHAP22SaSKxiJFrUWGhCrvAGPyKALvQRwrCmFP0wI0x+c7Oterw26XJTxI1zQHj95EGRHVLJmqs4gIOitQ=
ARC-Message-Signature: i=2; a=rsa-sha256; d=subspace.kernel.org;
	s=arc-20240116; t=1745310582; c=relaxed/simple;
	bh=jisQBUsR07c2jaOX1W6Ud3G/rSH/eeZBXxhLKdzKGQw=;
	h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References:
	 Content-Type:MIME-Version;
 b=D30/osjKN3aVg3nySgQAFb3DtkxMKVsVuYa0qGENWY8qVVXPk/dNL/yRnPeXmaIY6OHceE1n/nnkCs8DadLSlPrz1Xyh5WP8wF1wrxiiBwzZfvLxpSpfLCR4hYdBxgkBntPWWHVk6yv5tXfTEFc+yXzPLTp8wISD1jqHSLq7z74=
ARC-Authentication-Results: i=2; smtp.subspace.kernel.org;
 dmarc=pass (p=reject dis=none) header.from=nvidia.com;
 spf=fail smtp.mailfrom=nvidia.com;
 dkim=pass (2048-bit key) header.d=Nvidia.com header.i=@Nvidia.com
 header.b=L/cVaDQ0; arc=fail smtp.client-ip=40.107.223.49
Authentication-Results: smtp.subspace.kernel.org;
 dmarc=pass (p=reject dis=none) header.from=nvidia.com
Authentication-Results: smtp.subspace.kernel.org;
 spf=fail smtp.mailfrom=nvidia.com
Authentication-Results: smtp.subspace.kernel.org;
	dkim=pass (2048-bit key) header.d=Nvidia.com header.i=@Nvidia.com
 header.b="L/cVaDQ0"
ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none;
 b=CJMcayXSaMUFDCvo4yNV4kJJFQ8gg5IivuaX7CzGjgP8HVgnBFILp6bXO2OQmDwDSPGOEVMNyck1yf636JMFXrYLdC0tDDQBqRfbWXsT7hDfXA3MySnQAAQgYwM0G0K8cDrFk7GZD0oozLJqbzLK1KhSg7zfpwFRz1BaCACpefGz5n5vW1EatcYEqXWy4C5o7PTZHAVgspfM0Vel1ZC+Sg7jc/4yxDdsOaDa+wFc9AHZ8++De+d+zALEG3BGd31YMeONjkgTuaT05nL02pMJj2JHDGjtd5Sb+eu/DeQpJvZ9ahJllUWf4dTF76OnQE+b2yVyh4WSKKB5HiP6Pp/e6Q==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com;
 s=arcselector10001;
 h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1;
 bh=KDewoeyhtM0vh1bAUrwWUhNqF95n7XYvQFodDlu+LrY=;
 b=iDhxfq2vdXwT+46uV0V9i65LyzT7THENA2mjnmKN7szhRFktaK4gt5EqU1u08cLPXxrB78qcwz6RZdqEOWYwv1l9DwEQyd4SdoxY3hHGlJx9Ja5/AT/Qd5WNLbzYYwnm/EipQktadxki1mANjjUx6zwvNI+VnnivX2s3gl5twEcMRDWBEfl/JtxRgUbG1CkR8lfw47Bd7A1JNmxQ4kNxdMB9+tIkLIKEJ7SDS4nAaAqDO1vruyMCKuoBA2Q2Teo27NvPs0peLzw5A1TysnRAyADi/duMT5+Cf68s/LMtLH/C+qjPrngODFqtzqMyjyJZN34hxEAoFP5dODqNbTJ1nQ==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass
 smtp.mailfrom=nvidia.com; dmarc=pass action=none header.from=nvidia.com;
 dkim=pass header.d=nvidia.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=Nvidia.com;
 s=selector2;
 h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck;
 bh=KDewoeyhtM0vh1bAUrwWUhNqF95n7XYvQFodDlu+LrY=;
 b=L/cVaDQ0dJpOkx/p8GIP0czwTDcrcbRQOAfkbTMot9BScc7WiTC09QJEd21qw/I4+iVzsT7I5bI/PA7YKv3UTIoY0f6LIi7Pka/YlcFBpbqNed3t9bhBw5+f1u685KhPgecmMW22Ir8LDkDHjQpGw8GzrLRQO/eCs8uXxSzxdaEjVwAcyDiCrhSLO2o09CuvNsSLc4JgdGmVy6P3KXuR58eK7z7mEo8qhFGoJDE5Xde+6Hw+sCtjs+0vm6Wp+5X/1kmC8fnXRvW6sNH0t702nYOabE5pzb4YodVFKxypnxc6drrmFQdng5OCP+qCl5PRdT7BxwxZl2OAZ9TSbAmouQ==
Authentication-Results: dkim=none (message not signed)
 header.d=none;dmarc=none action=none header.from=nvidia.com;
Received: from CY5PR12MB6405.namprd12.prod.outlook.com (2603:10b6:930:3e::17)
 by SN7PR12MB8603.namprd12.prod.outlook.com (2603:10b6:806:260::15) with
 Microsoft SMTP Server (version=TLS1_2,
 cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.8655.33; Tue, 22 Apr
 2025 08:29:36 +0000
Received: from CY5PR12MB6405.namprd12.prod.outlook.com
 ([fe80::2119:c96c:b455:53b5]) by CY5PR12MB6405.namprd12.prod.outlook.com
 ([fe80::2119:c96c:b455:53b5%6]) with mapi id 15.20.8655.033; Tue, 22 Apr 2025
 08:29:36 +0000
From: Andrea Righi <arighi@nvidia.com>
To: Tejun Heo <tj@kernel.org>,
	David Vernet <void@manifault.com>,
	Changwoo Min <changwoo@igalia.com>
Cc: linux-kernel@vger.kernel.org
Subject: [PATCH 2/2] sched_ext: Fix missing rq lock in scx_bpf_cpuperf_set()
Date: Tue, 22 Apr 2025 10:26:33 +0200
Message-ID: <20250422082907.110167-3-arighi@nvidia.com>
X-Mailer: git-send-email 2.49.0
In-Reply-To: <20250422082907.110167-1-arighi@nvidia.com>
References: <20250422082907.110167-1-arighi@nvidia.com>
Content-Transfer-Encoding: quoted-printable
X-ClientProxiedBy: PH7PR13CA0012.namprd13.prod.outlook.com
 (2603:10b6:510:174::18) To CY5PR12MB6405.namprd12.prod.outlook.com
 (2603:10b6:930:3e::17)
Precedence: bulk
X-Mailing-List: linux-kernel@vger.kernel.org
List-Id: <linux-kernel.vger.kernel.org>
List-Subscribe: <mailto:linux-kernel+subscribe@vger.kernel.org>
List-Unsubscribe: <mailto:linux-kernel+unsubscribe@vger.kernel.org>
MIME-Version: 1.0
X-MS-PublicTrafficType: Email
X-MS-TrafficTypeDiagnostic: CY5PR12MB6405:EE_|SN7PR12MB8603:EE_
X-MS-Office365-Filtering-Correlation-Id: 60419441-3035-4e24-0875-08dd8177d0e0
X-MS-Exchange-SenderADCheck: 1
X-MS-Exchange-AntiSpam-Relay: 0
X-Microsoft-Antispam: BCL:0;ARA:13230040|376014|366016|1800799024;
X-Microsoft-Antispam-Message-Info: 
	=?us-ascii?Q?lpV+TNBgtYUKsUHh7N1tNAqoR1RHljVpr6fglygaIlfY4nKrazFE2YALdvLw?=
 =?us-ascii?Q?Rn0CRDhYGwIHz9AbFfLce+N2MALA46kbaYslt8JLghg/1mMLDQB79K7gw4T0?=
 =?us-ascii?Q?c1RKOc5Y2gKImpSswMmjVh+PiRboLX4+TmJuOoPIpU6vkpVr5P/C6OXgeL0/?=
 =?us-ascii?Q?QfXy7WdXo+4xteoOSE063oj99mQ9DDjoAKdc9gC87Ka+htELq/silvi6C07+?=
 =?us-ascii?Q?Z44ts0wS76jDY+Dx9heUscW1hOibEXIq7pPtE5exrdRwg8wWD9Dk82plwiWN?=
 =?us-ascii?Q?5O7UuxR6WTonbvhSIRGxLQTG/zJQvhgFusVl0PKTxn8VcIYMBtMXRqw7ccew?=
 =?us-ascii?Q?jfAU3tLwfdomQj4uDP+kzTc4Lo5ZXw0PPXSpwkFT4ZeK/3t9lXJ9Npx1lpxJ?=
 =?us-ascii?Q?nuYwAutaZQQqbDeDwnj97FsCqRWQhhbos1b4o3nO/NDe6+hOig3NnOMn+8hN?=
 =?us-ascii?Q?yaLf+QNCVF0MqegXb4QjDaUda1Y91CmKNVkJkj7JX8eEUh92NkRnfr1xI9rJ?=
 =?us-ascii?Q?etRWev7Qx29nIqY51MFXT79lcUH2YcKEdsSg2cxVosJh4cdCQ8ou6kYO7TQJ?=
 =?us-ascii?Q?9xqrpg3feqNZ0UEA2WnROAvCxgP2CJkbzuaCuzGRCiIjyVz7v+3NIqw0JLkp?=
 =?us-ascii?Q?G+hkwLFS8u9p61w1dlvE0JTBOMMPmSyo0RAW4gsOV30RcBM+mM0JG1TW6h7e?=
 =?us-ascii?Q?ZJwqFbManCW0XlwSmgHwFRKsRRmicxxHYONMm4VXWgOWN1n/0bnEKJwOi22x?=
 =?us-ascii?Q?8lGJ+0ptlqH2e6UiZx4pvS+JcCDnqsrKWhyT00IR40MHcpJ5XwOmdXkBytaw?=
 =?us-ascii?Q?2g0ogZLS8qM+kFyEmaeCsLRRWJLKKBDfN0jdLwSQnMT3TTSURYk48dm6oPTc?=
 =?us-ascii?Q?JuwST8p8nnz+31DYVfrjzM7s7IxuBpGv1PPQtv8EBRoTqGU7z0KDrLi/xJno?=
 =?us-ascii?Q?ST4TNRU2LoQZ/WWEWCXECmrEYVHhYeDPyRroFLKybbjEaO6OKfn4QHqGQMs2?=
 =?us-ascii?Q?wyzqRUJyeLz0HeuODC5m5dntSjB86DpPuMjqOqubeE+5MLR2OMcCH0Fhqp9u?=
 =?us-ascii?Q?rGRYVEsn7NTDrDdVfSywMX1twjFldIoFzEPArnqJqsBtManvmrdHRprAen4C?=
 =?us-ascii?Q?TO86CPLwTOLNLoiF3EjC+zsQjSmBn8SVwrlKar7/SERK2lPzCqh1b/xPqjVs?=
 =?us-ascii?Q?hIAV6Gy1Sx5KsISXUN1y3WY/SGJTCvwvCN7D1K4zBxDfs83DHJ0Auf3ABpa9?=
 =?us-ascii?Q?O1ApKnbO4sdW6dCKi5b6KfkC6wcKj8j7jKf8yX+YwztVJ/KgNIlJZaU6LeD3?=
 =?us-ascii?Q?WHBcQNAQfV4OqpxWp5JuGU/gSYqo1D40rPWx6uHZlg3CjNBI5rsUGqlemor0?=
 =?us-ascii?Q?E+pUxzU/ZYZv1sf8yOcL/U6t0tPGh955cVHznfrKl2yURglxlQ=3D=3D?=
X-Forefront-Antispam-Report: 
	CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:CY5PR12MB6405.namprd12.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230040)(376014)(366016)(1800799024);DIR:OUT;SFP:1101;
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: 
	=?us-ascii?Q?QXUTrgVncC60Py67VqhZ/Z1u/PbCb2xb4yN5oo6Vkx1sygmsfhd5hbOjODEf?=
 =?us-ascii?Q?IGPwZYKuVQQc28HPkacQcM//32xvmUX+HTETVPiHlYk2GrJM+bFCurxkOEsK?=
 =?us-ascii?Q?eGJIV2eTSKgDlVggg8pFF1xtCqW90gIraMa1CoM9kBAlEHUojHqQsNBt7Shp?=
 =?us-ascii?Q?Zylj/il83DzXOENVCg+ypStK2Wj8rX0teBOPx6SfjVwL5Fs/oXV27GNfsmzi?=
 =?us-ascii?Q?JaLZEaoORTUIaP+YH5lBi/11tuejgIcfzXA0tdcHtE8BH5VPusDf0/SZxvQd?=
 =?us-ascii?Q?1XZo/OBZBDHbFYI1htvz0BHEWQ5RfSDjAC/h2VIjgBQb+AM5if4gzLt2/KZ/?=
 =?us-ascii?Q?yq1lLr3DOoMB4vSeXOAjWlQGVvCVPMLY773cc7ckqnsVUEzazGEjGSUtl4Ze?=
 =?us-ascii?Q?RtuuTabE0nAqGusXRoFm6Nuduh7x/7RTsP9qZsXbtChbPkYz7ooW2Sfu0BbN?=
 =?us-ascii?Q?F3xIZTG9YzR/MOBcmQJxGxfZLOIkiATmYI3xEqJwJzf7sbra3yLROr6BnmgB?=
 =?us-ascii?Q?yKK2ZpPJBVLQbWl4SIcPMQyv1DEHwh8apeX8RiAmR4fvHkmgKZEqofStXZXQ?=
 =?us-ascii?Q?wCjtEAvenidQ8yDwF1Yo+GQUMyK01hdBZXpln5k6Nwtoe3FFG0No3U5I+x0J?=
 =?us-ascii?Q?hpG1YbTXNCB18f46W40I2ZOu4maXIRMNcc+uArnb0SXcXGSZozckc2PrIgty?=
 =?us-ascii?Q?R44GZRYcdfULxAkxZM6hJWIYZ5cz/EIBkUEPttFgFqswL6bSaI1FXCHyr+ud?=
 =?us-ascii?Q?10xpMYD+0hvcbjxzpfnSrk7id6NElyI9uaGJ7KkbC72r8X97dlOIXxfjdwmT?=
 =?us-ascii?Q?0OIpROg4ojbDgkeStMEDms8O5/gWTGu1D73LwvHGy8c4nap0+U12gYGEuOw+?=
 =?us-ascii?Q?oSJi9eC2fr9/O0G6sAnFAgEIYjqtlzbeJyw73f2yuiKoou0ob9W3oJXw9hcs?=
 =?us-ascii?Q?+aFb0QBEQYhS5NBCLkBL1vHaTXrWPKc69fajyASwu73GjQGADT7iDRPYSWu1?=
 =?us-ascii?Q?u8omK7W36Q63a3XS9Zq4eCfjW3Ir+AP34wmNl8Ze3G9SqtFu18zVFFq5mY3x?=
 =?us-ascii?Q?zBGsOK8bBd/vaBk7a7aTm38cD+k2Pb7RmbhijF6XbLX+/ETpb6KaGrE5UmPl?=
 =?us-ascii?Q?TVF5N6KdoEnpZAScsbVNqWgMexg/u4VxHvV0TeiVSC9AP42fo4UPlfE7Nkwy?=
 =?us-ascii?Q?22/E8QtIEWaUqlvdxb9kQIhFGd5uTHRywijgHZQdlyiskwd9LtkIfIcP+kco?=
 =?us-ascii?Q?N3QAKDA+kpiikxO1bmoCLaMY4CHBQBuSkAnc6Uc/x/g2hJmQIsnIY/l4Wi5n?=
 =?us-ascii?Q?Kp5y9ScWMYECbjP7DoVRVo/WPKD4/XnBTHbS4dyGhNzv2gIj2Y5+vzqSt0Wv?=
 =?us-ascii?Q?2u5dy2utWimkD9ifwm9jWkDOIMmcg8cLoqt87n/rp/OiE5b1UXU1eCtzAmo/?=
 =?us-ascii?Q?cd1vndU0o9xMQhaf51lXtktzu9HN6ha61B1n2SLJXuble6dytMNFu7GOjpbg?=
 =?us-ascii?Q?/eKudsVrwHod12g2GMaqN5OFg7JS1V+lW4rlx0V3kaCGp9do7A32B9YFKrcJ?=
 =?us-ascii?Q?Rkv+VaFiM1ytOSu2eSXg1GfQn78ZxIQ5MC3GV14j?=
X-OriginatorOrg: Nvidia.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 
 60419441-3035-4e24-0875-08dd8177d0e0
X-MS-Exchange-CrossTenant-AuthSource: CY5PR12MB6405.namprd12.prod.outlook.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 22 Apr 2025 08:29:36.2944
 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: 43083d15-7273-40c1-b7db-39efd9ccc17a
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: 
 jer458fbIA24rMRK2BuhODC2+pHkBwrVRc3umjldMu8K+MeDc1tsD9G5mL9YcEWNYz2fMo7Ty7hirlONEnjaLg==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: SN7PR12MB8603
Content-Type: text/plain; charset="utf-8"

scx_bpf_cpuperf_set() can be used to set a performance target level on
any CPU. However, it doesn't correctly acquire the corresponding rq
lock, which may lead to unsafe behavior and trigger the following
warning, due to the lockdep_assert_rq_held() check:

[   51.713737] WARNING: CPU: 3 PID: 3899 at kernel/sched/sched.h:1512 scx_b=
pf_cpuperf_set+0x1a0/0x1e0
...
[   51.713836] Call trace:
[   51.713837]  scx_bpf_cpuperf_set+0x1a0/0x1e0 (P)
[   51.713839]  bpf_prog_62d35beb9301601f_bpfland_init+0x168/0x440
[   51.713841]  bpf__sched_ext_ops_init+0x54/0x8c
[   51.713843]  scx_ops_enable.constprop.0+0x2c0/0x10f0
[   51.713845]  bpf_scx_reg+0x18/0x30
[   51.713847]  bpf_struct_ops_link_create+0x154/0x1b0
[   51.713849]  __sys_bpf+0x1934/0x22a0

Fix by properly acquiring the rq lock when possible or raising an error
if we try to operate on a CPU that is not the one currently locked.

Fixes: d86adb4fc0655 ("sched_ext: Add cpuperf support")
Signed-off-by: Andrea Righi <arighi@nvidia.com>
---
 kernel/sched/ext.c | 27 +++++++++++++++++++++++----
 1 file changed, 23 insertions(+), 4 deletions(-)

diff --git a/kernel/sched/ext.c b/kernel/sched/ext.c
index 3365b447cbdb8..a175b622716ce 100644
--- a/kernel/sched/ext.c
+++ b/kernel/sched/ext.c
@@ -7088,13 +7088,32 @@ __bpf_kfunc void scx_bpf_cpuperf_set(s32 cpu, u32 p=
erf)
 	}
=20
 	if (ops_cpu_valid(cpu, NULL)) {
-		struct rq *rq =3D cpu_rq(cpu);
+		struct rq *rq =3D cpu_rq(cpu), *locked_rq =3D scx_locked_rq();
+		struct rq_flags rf;
+
+		/*
+		 * When called with an rq lock held, restrict the operation
+		 * to the corresponding CPU to prevent ABBA deadlocks.
+		 */
+		if (locked_rq && rq !=3D locked_rq) {
+			scx_error("Invalid target CPU %d", cpu);
+			return;
+		}
+
+		/*
+		 * If no rq lock is held, allow to operate on any CPU by
+		 * acquiring the corresponding rq lock.
+		 */
+		if (!locked_rq) {
+			rq_lock_irqsave(rq, &rf);
+			update_rq_clock(rq);
+		}
=20
 		rq->scx.cpuperf_target =3D perf;
+		cpufreq_update_util(rq, 0);
=20
-		rcu_read_lock_sched_notrace();
-		cpufreq_update_util(cpu_rq(cpu), 0);
-		rcu_read_unlock_sched_notrace();
+		if (!locked_rq)
+			rq_unlock_irqrestore(rq, &rf);
 	}
 }
=20
--=20
2.49.0