From nobody Thu Oct 2 09:19:24 2025 Received: from CY7PR03CU001.outbound.protection.outlook.com (mail-westcentralusazon11010033.outbound.protection.outlook.com [40.93.198.33]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id B059B264A72 for ; Thu, 18 Sep 2025 17:24:49 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=fail smtp.client-ip=40.93.198.33 ARC-Seal: i=2; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1758216292; cv=fail; b=guKUZwI3VtpLMBWzjHuoUDtz5bnPuBddcaaeWG8SFG590xfF9xB2dchFP7PqotsAxmqFmgsFtyVqDnoP14tlBPoXgr1p8koJZsptJyL7XS/M2y9dm6PIqu3bWaXrCMDKf6/QrEvZjt7+AYVIESTg+4H5reg5J6jXvcSJmCgeLt0= ARC-Message-Signature: i=2; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1758216292; c=relaxed/simple; bh=kcaJddS/GNSMyNLvLd4Oo3sNTPAOkyrCsKnMCwj9vyo=; h=From:To:CC:Subject:Date:Message-ID:MIME-Version:Content-Type; b=Bp8BauYHCSW3/DaNrJZfxM8JLEZuxoDqmxoyKiSV61R1IZYiZlLcFsxmaUxz2JgsqBOGlLCiMSLDgLv4blpMB4vK3JE1pea0L8x+f7LpL12mMenMREWuTUjdGTa4NrS3Fl28teF7+fAPi3hZdcGfiu6ywaSAc4/ZGU5eSamGvXw= ARC-Authentication-Results: i=2; smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=amd.com; spf=fail smtp.mailfrom=amd.com; dkim=pass (1024-bit key) header.d=amd.com header.i=@amd.com header.b=32PBU7q5; arc=fail smtp.client-ip=40.93.198.33 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=amd.com Authentication-Results: smtp.subspace.kernel.org; spf=fail smtp.mailfrom=amd.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=amd.com header.i=@amd.com header.b="32PBU7q5" ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=mIzAJSAtNsznpbQANNZ1vmp2DYao3wTd+hCinlbgJOfj6HdNu6SA3NNk3zAFnj05IqJuGkz1v1n6AKfOIS75mtBq+HP/dquEQeXrIG0tMpr2yHNMwd5FxmxEDDbtMZ1y+21RquGk4my5+b4zM1XAwP3CnitxY4rFSaVV7yWDBsX+uoWYtnPg97gxAFB3lVT5iRXD8hhKaGPfDP596XHl9/h3HDZYbv5FhrLUuFtUKS87QLSo6eo/ec1/WB1y7ldqI+9J1JGZY+wupr1t+djKxdQPAaDoqoPZONKjnTBfB3CxLnaE2IUyr5cLDCVuA8P5A580j3ydwTPp/QD6JUMXOA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=qw2gfeIjZyGfnt7yUCsRbuGyPIATAgy32Uyc8RJtYNs=; b=GYSc/qM0Yavl0br34x8d/Qrj5XOjcnxIbZiExOfkfb0ZS+QmtrPkmHsGGn4z64skhPvd7ThQPEDybDVcOgqrW3rHNumolxbodduOfYcsYdAY4H3A7NwavhY8ip0lYjfBf7K5gY+PnWHnTLQbE6qVLSHSwQjJVqGbA36CUvBRHzqbVU15lHPVbCwAn+khAAT1e+e5iBsmUsKLcmzefBR/y5v4hprNcHJbeOGJ8HAUt7WETVmt67K2rPr0liHfRt2jX/2UcpvKLnaah+4yCyWM7Hityf68R7OeAJMlfosz85zJKPdm982xvG10u5KXUddQi7/5qNVwfoS/6XhBs/r5ew== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 165.204.84.17) smtp.rcpttodomain=kernel.org smtp.mailfrom=amd.com; dmarc=pass (p=quarantine sp=quarantine pct=100) action=none header.from=amd.com; dkim=none (message not signed); arc=none (0) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amd.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=qw2gfeIjZyGfnt7yUCsRbuGyPIATAgy32Uyc8RJtYNs=; b=32PBU7q5CR5alDt7L7KcvM/zXBt9N9C3b+CNEKoaPNtnXnQU3kD7bX5GbNYBH8c8o9bKrlfpIis6vaPyA1ffUCdgQTwhJ7hOQkSwW5AsrI+p4VQ9xo4oTDjaaTk1yNdl9HJWJnv3JnTi+lQNxZ2j7+TBSaaM/eV1jv139fT56u0= Received: from BN9PR03CA0957.namprd03.prod.outlook.com (2603:10b6:408:108::32) by DM6PR12MB4217.namprd12.prod.outlook.com (2603:10b6:5:219::24) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.9137.14; Thu, 18 Sep 2025 17:24:42 +0000 Received: from BN3PEPF0000B069.namprd21.prod.outlook.com (2603:10b6:408:108:cafe::e1) by BN9PR03CA0957.outlook.office365.com (2603:10b6:408:108::32) with Microsoft SMTP Server (version=TLS1_3, cipher=TLS_AES_256_GCM_SHA384) id 15.20.9115.19 via Frontend Transport; Thu, 18 Sep 2025 17:24:42 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 165.204.84.17) smtp.mailfrom=amd.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=amd.com; Received-SPF: Pass (protection.outlook.com: domain of amd.com designates 165.204.84.17 as permitted sender) receiver=protection.outlook.com; client-ip=165.204.84.17; helo=satlexmb07.amd.com; pr=C Received: from satlexmb07.amd.com (165.204.84.17) by BN3PEPF0000B069.mail.protection.outlook.com (10.167.243.68) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.9160.0 via Frontend Transport; Thu, 18 Sep 2025 17:24:42 +0000 Received: from satlexmb10.amd.com (10.181.42.219) by satlexmb07.amd.com (10.181.42.216) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.2562.17; Thu, 18 Sep 2025 10:24:42 -0700 Received: from satlexmb07.amd.com (10.181.42.216) by satlexmb10.amd.com (10.181.42.219) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.2562.17; Thu, 18 Sep 2025 10:24:41 -0700 Received: from xsjlizhih51.xilinx.com (10.180.168.240) by satlexmb07.amd.com (10.181.42.216) with Microsoft SMTP Server id 15.2.2562.17 via Frontend Transport; Thu, 18 Sep 2025 10:24:41 -0700 From: Lizhi Hou To: , , , CC: Lizhi Hou , , , , Subject: [PATCH V1] accel/amdxdna: Enhance runtime power management Date: Thu, 18 Sep 2025 10:24:39 -0700 Message-ID: <20250918172439.428139-1-lizhi.hou@amd.com> X-Mailer: git-send-email 2.34.1 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: BN3PEPF0000B069:EE_|DM6PR12MB4217:EE_ X-MS-Office365-Filtering-Correlation-Id: 4594721f-3df3-4726-a324-08ddf6d8413e X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0;ARA:13230040|1800799024|36860700013|82310400026|376014; X-Microsoft-Antispam-Message-Info: =?us-ascii?Q?8qJIHK/r0/vkl9S4vexW/9UZmSQ9J2dLBv4fsENUr1lwhF3BX/opW6NLZ5gT?= =?us-ascii?Q?cdAHjHfDT5Qz0Zk4SUQGZ2TWQDULcPAKx3JYp2k4vCedSahWlkzD6yNrh9om?= =?us-ascii?Q?nE6gu81TmbJlubNW2tlodlXdURhxOoj7objKrIeKMiZt4E3I2LIIwvxyJavH?= =?us-ascii?Q?O1G0hqfHNweo2NC+95MqF+yfkPKCOlNTgVoSBRHeNMWVyZ2KcTABUtT3XAY/?= =?us-ascii?Q?YkGrzhzzS4x+0jRrqoi/6hmes4XhL5wr9IDxRZt+CSkNPD7enwHsS/v6eS6K?= =?us-ascii?Q?tkolSrAdc6EeuzyJsi9dn6M505muP0cpuT1HIg5NVsNoZsBnrZwPijKm8yN4?= =?us-ascii?Q?9D4zl0HrHuAP5dUjcY40CvvpbrjD6G02qVz4Z/ZmL7jtIxmlZBCdBLQhz89k?= =?us-ascii?Q?q9geXhWkljUmrK1eMjgl0+hi998YQeYxhmUTcWRNV4JM6ldWD8NRQTUMrqHH?= =?us-ascii?Q?5qCpLWApPFo7LVaOuAe+PjBOa9cOtscv0PVLTf03q1aLdGu5dJbPtYhJHFJk?= =?us-ascii?Q?ntZkcGDlP+T1fwR4pVlGEKY0FZpw4zrD9sbqsjBABJFYP9MZLxnpwxtWt7PJ?= =?us-ascii?Q?C1ZvXRWvk7f9r4YvCvLMBQkEGAokb8rkf9JUJXRtuxJdOdrh4CGj8IqzrQqE?= =?us-ascii?Q?a9hFyUhZcq8SFChSiBqn9YKcgmuPAdyUbjb8ZfrgKHfxeh/7ogChvQlm1ZWQ?= =?us-ascii?Q?GjKDXa79qMbwYQGX7pxYmPUtxcCAzsuUw7nwcbLPntt9hqqFrg82Ya49QQVH?= =?us-ascii?Q?6DgchYyRShiiNI74WBWP45sZWB4EHuWc17IRCUZ5IOxTgNzxxYJLlWTuI+hT?= =?us-ascii?Q?t891YkRODtcCpXzhQ30ZaCfEdLQH5y2pNTwzDS201jjOxJkDt96/+BTqWF3y?= =?us-ascii?Q?Mkw75hEZJRdmJkf+3fX1mDjGDrJRC1nxciJ1VruldgPVMLmpch3e3FOSPDvN?= =?us-ascii?Q?rFDr3h9ySIMR/aZ/tOMzcCUgX8czBgFsC8eBlUTMye3EMfkrWEJcipqZyJl/?= =?us-ascii?Q?WOszeEXgLWQKwYCrcdOhTXcqu2ktE5WxB+OSYO0FNHCObTLogFZtztuDRyNE?= =?us-ascii?Q?WnpDqoe8ZzKIUvhCsMoRGbbywWIvbQgDCy8MYyVG89neBrza7bgSZRzDJbCZ?= =?us-ascii?Q?RBLzcONN8BmuNP5jg64+T0kduvSNbPqN1W32++SG0eNYfoe8Q8C9vPQBsah7?= =?us-ascii?Q?fiVc5HU/yY7KJCDbvz+CiwRKup2Xxl7U+Y3Ebl6i9iZTAHrMIcIVeN+vsu7I?= =?us-ascii?Q?VtcfkdNRC1CKAOO8FBz5CGoXDi9vCfGXn3OYymrndZSWxevO2XfIZemyamPU?= =?us-ascii?Q?sf3K1OMP2Eoe4xb+HcuHmjQSccvhkGk8OZicmCXfVX2BJUsqVSY93ZDUWo0a?= =?us-ascii?Q?DqdHj+ZMUMPam1wjvTXLMEkukZf5zcw1lzNAPCkZCDtZM+Zun2RhrSg5yHQ9?= =?us-ascii?Q?uOWI+2DymIb5ztefKyNsmJb36ISz0B3boYaPH9rU7iG7SaiH6OUspRVMN83+?= =?us-ascii?Q?xq3FdyKgYLOl66wpekLy0+Xuxqp6aWr950E0?= X-Forefront-Antispam-Report: CIP:165.204.84.17;CTRY:US;LANG:en;SCL:1;SRV:;IPV:CAL;SFV:NSPM;H:satlexmb07.amd.com;PTR:InfoDomainNonexistent;CAT:NONE;SFS:(13230040)(1800799024)(36860700013)(82310400026)(376014);DIR:OUT;SFP:1101; X-OriginatorOrg: amd.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 18 Sep 2025 17:24:42.4056 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 4594721f-3df3-4726-a324-08ddf6d8413e X-MS-Exchange-CrossTenant-Id: 3dd8961f-e488-4e60-8e11-a82d994e183d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=3dd8961f-e488-4e60-8e11-a82d994e183d;Ip=[165.204.84.17];Helo=[satlexmb07.amd.com] X-MS-Exchange-CrossTenant-AuthSource: BN3PEPF0000B069.namprd21.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM6PR12MB4217 Content-Type: text/plain; charset="utf-8" Currently, pm_runtime_resume_and_get() is invoked in the driver's open callback, and pm_runtime_put_autosuspend() is called in the close callback. As a result, the device remains active whenever an application opens it, even if no I/O is performed, leading to unnecessary power consumption. Move the runtime PM calls to the AIE2 callbacks that actually interact with the hardware. The device will automatically suspend after 5 seconds of inactivity (no hardware accesses and no pending commands), and it will be resumed on the next hardware access. Signed-off-by: Lizhi Hou --- drivers/accel/amdxdna/Makefile | 1 + drivers/accel/amdxdna/aie2_ctx.c | 42 ++++++++--- drivers/accel/amdxdna/aie2_message.c | 28 ++++---- drivers/accel/amdxdna/aie2_pci.c | 79 ++++++++++---------- drivers/accel/amdxdna/aie2_pci.h | 3 +- drivers/accel/amdxdna/aie2_smu.c | 28 ++++++-- drivers/accel/amdxdna/amdxdna_ctx.c | 54 +++++++------- drivers/accel/amdxdna/amdxdna_mailbox.c | 13 ++-- drivers/accel/amdxdna/amdxdna_pci_drv.c | 56 ++------------- drivers/accel/amdxdna/amdxdna_pci_drv.h | 2 + drivers/accel/amdxdna/amdxdna_pm.c | 96 +++++++++++++++++++++++++ drivers/accel/amdxdna/amdxdna_pm.h | 18 +++++ 12 files changed, 262 insertions(+), 158 deletions(-) create mode 100644 drivers/accel/amdxdna/amdxdna_pm.c create mode 100644 drivers/accel/amdxdna/amdxdna_pm.h diff --git a/drivers/accel/amdxdna/Makefile b/drivers/accel/amdxdna/Makefile index 6797dac65efa..6344aaf523fa 100644 --- a/drivers/accel/amdxdna/Makefile +++ b/drivers/accel/amdxdna/Makefile @@ -14,6 +14,7 @@ amdxdna-y :=3D \ amdxdna_mailbox.o \ amdxdna_mailbox_helper.o \ amdxdna_pci_drv.o \ + amdxdna_pm.o \ amdxdna_sysfs.o \ amdxdna_ubuf.o \ npu1_regs.o \ diff --git a/drivers/accel/amdxdna/aie2_ctx.c b/drivers/accel/amdxdna/aie2_= ctx.c index e9f9b1fa5dc1..691fdb3b008f 100644 --- a/drivers/accel/amdxdna/aie2_ctx.c +++ b/drivers/accel/amdxdna/aie2_ctx.c @@ -21,6 +21,7 @@ #include "amdxdna_gem.h" #include "amdxdna_mailbox.h" #include "amdxdna_pci_drv.h" +#include "amdxdna_pm.h" =20 static bool force_cmdlist; module_param(force_cmdlist, bool, 0600); @@ -88,7 +89,7 @@ static int aie2_hwctx_restart(struct amdxdna_dev *xdna, s= truct amdxdna_hwctx *hw goto out; } =20 - ret =3D aie2_config_cu(hwctx); + ret =3D aie2_config_cu(hwctx, NULL); if (ret) { XDNA_ERR(xdna, "Config cu failed, ret %d", ret); goto out; @@ -167,14 +168,11 @@ static int aie2_hwctx_resume_cb(struct amdxdna_hwctx = *hwctx, void *arg) =20 int aie2_hwctx_resume(struct amdxdna_client *client) { - struct amdxdna_dev *xdna =3D client->xdna; - /* * The resume path cannot guarantee that mailbox channel can be * regenerated. If this happen, when submit message to this * mailbox channel, error will return. */ - drm_WARN_ON(&xdna->ddev, !mutex_is_locked(&xdna->dev_lock)); return amdxdna_hwctx_walk(client, NULL, aie2_hwctx_resume_cb); } =20 @@ -184,6 +182,8 @@ aie2_sched_notify(struct amdxdna_sched_job *job) struct dma_fence *fence =3D job->fence; =20 trace_xdna_job(&job->base, job->hwctx->name, "signaled fence", job->seq); + + amdxdna_pm_suspend_put(job->hwctx->client->xdna); job->hwctx->priv->completed++; dma_fence_signal(fence); =20 @@ -531,7 +531,7 @@ int aie2_hwctx_init(struct amdxdna_hwctx *hwctx) .num_rqs =3D DRM_SCHED_PRIORITY_COUNT, .credit_limit =3D HWCTX_MAX_CMDS, .timeout =3D msecs_to_jiffies(HWCTX_MAX_TIMEOUT), - .name =3D hwctx->name, + .name =3D "amdxdna_js", .dev =3D xdna->ddev.dev, }; struct drm_gpu_scheduler *sched; @@ -697,6 +697,14 @@ void aie2_hwctx_fini(struct amdxdna_hwctx *hwctx) kfree(hwctx->cus); } =20 +static int aie2_config_cu_resp_handler(void *handle, void __iomem *data, s= ize_t size) +{ + struct amdxdna_hwctx *hwctx =3D handle; + + amdxdna_pm_suspend_put(hwctx->client->xdna); + return 0; +} + static int aie2_hwctx_cu_config(struct amdxdna_hwctx *hwctx, void *buf, u3= 2 size) { struct amdxdna_hwctx_param_config_cu *config =3D buf; @@ -728,10 +736,14 @@ static int aie2_hwctx_cu_config(struct amdxdna_hwctx = *hwctx, void *buf, u32 size if (!hwctx->cus) return -ENOMEM; =20 - ret =3D aie2_config_cu(hwctx); + ret =3D amdxdna_pm_resume_get(xdna); + if (ret) + goto free_cus; + + ret =3D aie2_config_cu(hwctx, aie2_config_cu_resp_handler); if (ret) { XDNA_ERR(xdna, "Config CU to firmware failed, ret %d", ret); - goto free_cus; + goto pm_suspend_put; } =20 wmb(); /* To avoid locking in command submit when check status */ @@ -739,6 +751,8 @@ static int aie2_hwctx_cu_config(struct amdxdna_hwctx *h= wctx, void *buf, u32 size =20 return 0; =20 +pm_suspend_put: + amdxdna_pm_suspend_put(xdna); free_cus: kfree(hwctx->cus); hwctx->cus =3D NULL; @@ -862,11 +876,15 @@ int aie2_cmd_submit(struct amdxdna_hwctx *hwctx, stru= ct amdxdna_sched_job *job, goto free_chain; } =20 + ret =3D amdxdna_pm_resume_get(xdna); + if (ret) + goto cleanup_job; + retry: ret =3D drm_gem_lock_reservations(job->bos, job->bo_cnt, &acquire_ctx); if (ret) { XDNA_WARN(xdna, "Failed to lock BOs, ret %d", ret); - goto cleanup_job; + goto suspend_put; } =20 for (i =3D 0; i < job->bo_cnt; i++) { @@ -874,7 +892,7 @@ int aie2_cmd_submit(struct amdxdna_hwctx *hwctx, struct= amdxdna_sched_job *job, if (ret) { XDNA_WARN(xdna, "Failed to reserve fences %d", ret); drm_gem_unlock_reservations(job->bos, job->bo_cnt, &acquire_ctx); - goto cleanup_job; + goto suspend_put; } } =20 @@ -889,12 +907,12 @@ int aie2_cmd_submit(struct amdxdna_hwctx *hwctx, stru= ct amdxdna_sched_job *job, msecs_to_jiffies(HMM_RANGE_DEFAULT_TIMEOUT); } else if (time_after(jiffies, timeout)) { ret =3D -ETIME; - goto cleanup_job; + goto suspend_put; } =20 ret =3D aie2_populate_range(abo); if (ret) - goto cleanup_job; + goto suspend_put; goto retry; } } @@ -920,6 +938,8 @@ int aie2_cmd_submit(struct amdxdna_hwctx *hwctx, struct= amdxdna_sched_job *job, =20 return 0; =20 +suspend_put: + amdxdna_pm_suspend_put(xdna); cleanup_job: drm_sched_job_cleanup(&job->base); free_chain: diff --git a/drivers/accel/amdxdna/aie2_message.c b/drivers/accel/amdxdna/a= ie2_message.c index 9caad083543d..4660e8297ed8 100644 --- a/drivers/accel/amdxdna/aie2_message.c +++ b/drivers/accel/amdxdna/aie2_message.c @@ -37,7 +37,7 @@ static int aie2_send_mgmt_msg_wait(struct amdxdna_dev_hdl= *ndev, if (!ndev->mgmt_chann) return -ENODEV; =20 - drm_WARN_ON(&xdna->ddev, !mutex_is_locked(&xdna->dev_lock)); + drm_WARN_ON(&xdna->ddev, xdna->rpm_on && !mutex_is_locked(&xdna->dev_lock= )); ret =3D xdna_send_msg_wait(xdna, ndev->mgmt_chann, msg); if (ret =3D=3D -ETIME) { xdna_mailbox_stop_channel(ndev->mgmt_chann); @@ -377,15 +377,17 @@ int aie2_register_asyn_event_msg(struct amdxdna_dev_h= dl *ndev, dma_addr_t addr, return xdna_mailbox_send_msg(ndev->mgmt_chann, &msg, TX_TIMEOUT); } =20 -int aie2_config_cu(struct amdxdna_hwctx *hwctx) +int aie2_config_cu(struct amdxdna_hwctx *hwctx, + int (*notify_cb)(void *, void __iomem *, size_t)) { struct mailbox_channel *chann =3D hwctx->priv->mbox_chann; struct amdxdna_dev *xdna =3D hwctx->client->xdna; u32 shift =3D xdna->dev_info->dev_mem_buf_shift; - DECLARE_AIE2_MSG(config_cu, MSG_OP_CONFIG_CU); + struct config_cu_req req =3D { 0 }; + struct xdna_mailbox_msg msg; struct drm_gem_object *gobj; struct amdxdna_gem_obj *abo; - int ret, i; + int i; =20 if (!chann) return -ENODEV; @@ -423,18 +425,12 @@ int aie2_config_cu(struct amdxdna_hwctx *hwctx) } req.num_cus =3D hwctx->cus->num_cus; =20 - ret =3D xdna_send_msg_wait(xdna, chann, &msg); - if (ret =3D=3D -ETIME) - aie2_destroy_context(xdna->dev_handle, hwctx); - - if (resp.status =3D=3D AIE2_STATUS_SUCCESS) { - XDNA_DBG(xdna, "Configure %d CUs, ret %d", req.num_cus, ret); - return 0; - } - - XDNA_ERR(xdna, "Command opcode 0x%x failed, status 0x%x ret %d", - msg.opcode, resp.status, ret); - return ret; + msg.send_data =3D (u8 *)&req; + msg.send_size =3D sizeof(req); + msg.handle =3D hwctx; + msg.opcode =3D MSG_OP_CONFIG_CU; + msg.notify_cb =3D notify_cb; + return xdna_mailbox_send_msg(chann, &msg, TX_TIMEOUT); } =20 int aie2_execbuf(struct amdxdna_hwctx *hwctx, struct amdxdna_sched_job *jo= b, diff --git a/drivers/accel/amdxdna/aie2_pci.c b/drivers/accel/amdxdna/aie2_= pci.c index 6e39c769bb6d..f46a3dcd0580 100644 --- a/drivers/accel/amdxdna/aie2_pci.c +++ b/drivers/accel/amdxdna/aie2_pci.c @@ -25,6 +25,7 @@ #include "amdxdna_gem.h" #include "amdxdna_mailbox.h" #include "amdxdna_pci_drv.h" +#include "amdxdna_pm.h" =20 static int aie2_max_col =3D XRS_MAX_COL; module_param(aie2_max_col, uint, 0600); @@ -223,15 +224,6 @@ static int aie2_mgmt_fw_init(struct amdxdna_dev_hdl *n= dev) return ret; } =20 - if (!ndev->async_events) - return 0; - - ret =3D aie2_error_async_events_send(ndev); - if (ret) { - XDNA_ERR(ndev->xdna, "Send async events failed"); - return ret; - } - return 0; } =20 @@ -257,6 +249,8 @@ static int aie2_mgmt_fw_query(struct amdxdna_dev_hdl *n= dev) return ret; } =20 + ndev->total_col =3D min(aie2_max_col, ndev->metadata.cols); + return 0; } =20 @@ -338,6 +332,7 @@ static void aie2_hw_stop(struct amdxdna_dev *xdna) ndev->mbox =3D NULL; aie2_psp_stop(ndev->psp_hdl); aie2_smu_fini(ndev); + aie2_error_async_events_free(ndev); pci_disable_device(pdev); =20 ndev->dev_status =3D AIE2_DEV_INIT; @@ -424,6 +419,18 @@ static int aie2_hw_start(struct amdxdna_dev *xdna) goto destroy_mgmt_chann; } =20 + ret =3D aie2_mgmt_fw_query(ndev); + if (ret) { + XDNA_ERR(xdna, "failed to query fw, ret %d", ret); + goto destroy_mgmt_chann; + } + + ret =3D aie2_error_async_events_alloc(ndev); + if (ret) { + XDNA_ERR(xdna, "Allocate async events failed, ret %d", ret); + goto destroy_mgmt_chann; + } + ndev->dev_status =3D AIE2_DEV_START; =20 return 0; @@ -459,7 +466,6 @@ static int aie2_hw_resume(struct amdxdna_dev *xdna) struct amdxdna_client *client; int ret; =20 - guard(mutex)(&xdna->dev_lock); ret =3D aie2_hw_start(xdna); if (ret) { XDNA_ERR(xdna, "Start hardware failed, %d", ret); @@ -565,13 +571,6 @@ static int aie2_init(struct amdxdna_dev *xdna) goto release_fw; } =20 - ret =3D aie2_mgmt_fw_query(ndev); - if (ret) { - XDNA_ERR(xdna, "Query firmware failed, ret %d", ret); - goto stop_hw; - } - ndev->total_col =3D min(aie2_max_col, ndev->metadata.cols); - xrs_cfg.clk_list.num_levels =3D ndev->max_dpm_level + 1; for (i =3D 0; i < xrs_cfg.clk_list.num_levels; i++) xrs_cfg.clk_list.cu_clk_list[i] =3D ndev->priv->dpm_clk_tbl[i].hclk; @@ -587,30 +586,10 @@ static int aie2_init(struct amdxdna_dev *xdna) goto stop_hw; } =20 - ret =3D aie2_error_async_events_alloc(ndev); - if (ret) { - XDNA_ERR(xdna, "Allocate async events failed, ret %d", ret); - goto stop_hw; - } - - ret =3D aie2_error_async_events_send(ndev); - if (ret) { - XDNA_ERR(xdna, "Send async events failed, ret %d", ret); - goto async_event_free; - } - - /* Issue a command to make sure firmware handled async events */ - ret =3D aie2_query_firmware_version(ndev, &ndev->xdna->fw_ver); - if (ret) { - XDNA_ERR(xdna, "Re-query firmware version failed"); - goto async_event_free; - } - release_firmware(fw); + amdxdna_rpm_init(xdna); return 0; =20 -async_event_free: - aie2_error_async_events_free(ndev); stop_hw: aie2_hw_stop(xdna); release_fw: @@ -621,10 +600,8 @@ static int aie2_init(struct amdxdna_dev *xdna) =20 static void aie2_fini(struct amdxdna_dev *xdna) { - struct amdxdna_dev_hdl *ndev =3D xdna->dev_handle; - + amdxdna_rpm_fini(xdna); aie2_hw_stop(xdna); - aie2_error_async_events_free(ndev); } =20 static int aie2_get_aie_status(struct amdxdna_client *client, @@ -856,6 +833,10 @@ static int aie2_get_info(struct amdxdna_client *client= , struct amdxdna_drm_get_i if (!drm_dev_enter(&xdna->ddev, &idx)) return -ENODEV; =20 + ret =3D amdxdna_pm_resume_get(xdna); + if (ret) + goto dev_exit; + switch (args->param) { case DRM_AMDXDNA_QUERY_AIE_STATUS: ret =3D aie2_get_aie_status(client, args); @@ -882,8 +863,11 @@ static int aie2_get_info(struct amdxdna_client *client= , struct amdxdna_drm_get_i XDNA_ERR(xdna, "Not supported request parameter %u", args->param); ret =3D -EOPNOTSUPP; } + + amdxdna_pm_suspend_put(xdna); XDNA_DBG(xdna, "Got param %d", args->param); =20 +dev_exit: drm_dev_exit(idx); return ret; } @@ -932,6 +916,10 @@ static int aie2_get_array(struct amdxdna_client *clien= t, if (!drm_dev_enter(&xdna->ddev, &idx)) return -ENODEV; =20 + ret =3D amdxdna_pm_resume_get(xdna); + if (ret) + goto dev_exit; + switch (args->param) { case DRM_AMDXDNA_HW_CONTEXT_ALL: ret =3D aie2_query_ctx_status_array(client, args); @@ -940,8 +928,11 @@ static int aie2_get_array(struct amdxdna_client *clien= t, XDNA_ERR(xdna, "Not supported request parameter %u", args->param); ret =3D -EOPNOTSUPP; } + + amdxdna_pm_suspend_put(xdna); XDNA_DBG(xdna, "Got param %d", args->param); =20 +dev_exit: drm_dev_exit(idx); return ret; } @@ -980,6 +971,10 @@ static int aie2_set_state(struct amdxdna_client *clien= t, if (!drm_dev_enter(&xdna->ddev, &idx)) return -ENODEV; =20 + ret =3D amdxdna_pm_resume_get(xdna); + if (ret) + goto dev_exit; + switch (args->param) { case DRM_AMDXDNA_SET_POWER_MODE: ret =3D aie2_set_power_mode(client, args); @@ -990,6 +985,8 @@ static int aie2_set_state(struct amdxdna_client *client, break; } =20 + amdxdna_pm_suspend_put(xdna); +dev_exit: drm_dev_exit(idx); return ret; } diff --git a/drivers/accel/amdxdna/aie2_pci.h b/drivers/accel/amdxdna/aie2_= pci.h index 91a8e948f82a..289a23ecd5f1 100644 --- a/drivers/accel/amdxdna/aie2_pci.h +++ b/drivers/accel/amdxdna/aie2_pci.h @@ -272,7 +272,8 @@ int aie2_map_host_buf(struct amdxdna_dev_hdl *ndev, u32= context_id, u64 addr, u6 int aie2_query_status(struct amdxdna_dev_hdl *ndev, char __user *buf, u32 = size, u32 *cols_filled); int aie2_register_asyn_event_msg(struct amdxdna_dev_hdl *ndev, dma_addr_t = addr, u32 size, void *handle, int (*cb)(void*, void __iomem *, size_t)); -int aie2_config_cu(struct amdxdna_hwctx *hwctx); +int aie2_config_cu(struct amdxdna_hwctx *hwctx, + int (*notify_cb)(void *, void __iomem *, size_t)); int aie2_execbuf(struct amdxdna_hwctx *hwctx, struct amdxdna_sched_job *jo= b, int (*notify_cb)(void *, void __iomem *, size_t)); int aie2_cmdlist_single_execbuf(struct amdxdna_hwctx *hwctx, diff --git a/drivers/accel/amdxdna/aie2_smu.c b/drivers/accel/amdxdna/aie2_= smu.c index d303701b0ded..7f292a615ed8 100644 --- a/drivers/accel/amdxdna/aie2_smu.c +++ b/drivers/accel/amdxdna/aie2_smu.c @@ -11,6 +11,7 @@ =20 #include "aie2_pci.h" #include "amdxdna_pci_drv.h" +#include "amdxdna_pm.h" =20 #define SMU_RESULT_OK 1 =20 @@ -59,12 +60,16 @@ int npu1_set_dpm(struct amdxdna_dev_hdl *ndev, u32 dpm_= level) u32 freq; int ret; =20 + ret =3D amdxdna_pm_resume_get(ndev->xdna); + if (ret) + return ret; + ret =3D aie2_smu_exec(ndev, AIE2_SMU_SET_MPNPUCLK_FREQ, ndev->priv->dpm_clk_tbl[dpm_level].npuclk, &freq); if (ret) { XDNA_ERR(ndev->xdna, "Set npu clock to %d failed, ret %d\n", ndev->priv->dpm_clk_tbl[dpm_level].npuclk, ret); - return ret; + goto suspend_put; } ndev->npuclk_freq =3D freq; =20 @@ -73,8 +78,10 @@ int npu1_set_dpm(struct amdxdna_dev_hdl *ndev, u32 dpm_l= evel) if (ret) { XDNA_ERR(ndev->xdna, "Set h clock to %d failed, ret %d\n", ndev->priv->dpm_clk_tbl[dpm_level].hclk, ret); - return ret; + goto suspend_put; } + + amdxdna_pm_suspend_put(ndev->xdna); ndev->hclk_freq =3D freq; ndev->dpm_level =3D dpm_level; =20 @@ -82,26 +89,35 @@ int npu1_set_dpm(struct amdxdna_dev_hdl *ndev, u32 dpm_= level) ndev->npuclk_freq, ndev->hclk_freq); =20 return 0; + +suspend_put: + amdxdna_pm_suspend_put(ndev->xdna); + return ret; } =20 int npu4_set_dpm(struct amdxdna_dev_hdl *ndev, u32 dpm_level) { int ret; =20 + ret =3D amdxdna_pm_resume_get(ndev->xdna); + if (ret) + return ret; + ret =3D aie2_smu_exec(ndev, AIE2_SMU_SET_HARD_DPMLEVEL, dpm_level, NULL); if (ret) { XDNA_ERR(ndev->xdna, "Set hard dpm level %d failed, ret %d ", dpm_level, ret); - return ret; + goto suspend_put; } =20 ret =3D aie2_smu_exec(ndev, AIE2_SMU_SET_SOFT_DPMLEVEL, dpm_level, NULL); if (ret) { XDNA_ERR(ndev->xdna, "Set soft dpm level %d failed, ret %d", dpm_level, ret); - return ret; + goto suspend_put; } =20 + amdxdna_pm_suspend_put(ndev->xdna); ndev->npuclk_freq =3D ndev->priv->dpm_clk_tbl[dpm_level].npuclk; ndev->hclk_freq =3D ndev->priv->dpm_clk_tbl[dpm_level].hclk; ndev->dpm_level =3D dpm_level; @@ -110,6 +126,10 @@ int npu4_set_dpm(struct amdxdna_dev_hdl *ndev, u32 dpm= _level) ndev->npuclk_freq, ndev->hclk_freq); =20 return 0; + +suspend_put: + amdxdna_pm_suspend_put(ndev->xdna); + return ret; } =20 int aie2_smu_init(struct amdxdna_dev_hdl *ndev) diff --git a/drivers/accel/amdxdna/amdxdna_ctx.c b/drivers/accel/amdxdna/am= dxdna_ctx.c index 4bfe4ef20550..c3300eff7460 100644 --- a/drivers/accel/amdxdna/amdxdna_ctx.c +++ b/drivers/accel/amdxdna/amdxdna_ctx.c @@ -161,14 +161,9 @@ int amdxdna_drm_create_hwctx_ioctl(struct drm_device *= dev, void *data, struct dr if (args->ext || args->ext_flags) return -EINVAL; =20 - if (!drm_dev_enter(dev, &idx)) - return -ENODEV; - hwctx =3D kzalloc(sizeof(*hwctx), GFP_KERNEL); - if (!hwctx) { - ret =3D -ENOMEM; - goto exit; - } + if (!hwctx) + return -ENOMEM; =20 if (copy_from_user(&hwctx->qos, u64_to_user_ptr(args->qos_p), sizeof(hwct= x->qos))) { XDNA_ERR(xdna, "Access QoS info failed"); @@ -181,30 +176,36 @@ int amdxdna_drm_create_hwctx_ioctl(struct drm_device = *dev, void *data, struct dr hwctx->num_tiles =3D args->num_tiles; hwctx->mem_size =3D args->mem_size; hwctx->max_opc =3D args->max_opc; - ret =3D xa_alloc_cyclic(&client->hwctx_xa, &hwctx->id, hwctx, - XA_LIMIT(AMDXDNA_INVALID_CTX_HANDLE + 1, MAX_HWCTX_ID), - &client->next_hwctxid, GFP_KERNEL); - if (ret < 0) { - XDNA_ERR(xdna, "Allocate hwctx ID failed, ret %d", ret); + + guard(mutex)(&xdna->dev_lock); + + if (!drm_dev_enter(dev, &idx)) { + ret =3D -ENODEV; goto free_hwctx; } =20 - hwctx->name =3D kasprintf(GFP_KERNEL, "hwctx.%d.%d", client->pid, hwctx->= id); + ret =3D xdna->dev_info->ops->hwctx_init(hwctx); + if (ret) { + XDNA_ERR(xdna, "Init hwctx failed, ret %d", ret); + goto dev_exit; + } + + hwctx->name =3D kasprintf(GFP_KERNEL, "hwctx.%d.%d", client->pid, hwctx->= fw_ctx_id); if (!hwctx->name) { ret =3D -ENOMEM; - goto rm_id; + goto fini_hwctx; } =20 - mutex_lock(&xdna->dev_lock); - ret =3D xdna->dev_info->ops->hwctx_init(hwctx); - if (ret) { - mutex_unlock(&xdna->dev_lock); - XDNA_ERR(xdna, "Init hwctx failed, ret %d", ret); + ret =3D xa_alloc_cyclic(&client->hwctx_xa, &hwctx->id, hwctx, + XA_LIMIT(AMDXDNA_INVALID_CTX_HANDLE + 1, MAX_HWCTX_ID), + &client->next_hwctxid, GFP_KERNEL); + if (ret < 0) { + XDNA_ERR(xdna, "Allocate hwctx ID failed, ret %d", ret); goto free_name; } + args->handle =3D hwctx->id; args->syncobj_handle =3D hwctx->syncobj_hdl; - mutex_unlock(&xdna->dev_lock); =20 atomic64_set(&hwctx->job_submit_cnt, 0); atomic64_set(&hwctx->job_free_cnt, 0); @@ -214,12 +215,12 @@ int amdxdna_drm_create_hwctx_ioctl(struct drm_device = *dev, void *data, struct dr =20 free_name: kfree(hwctx->name); -rm_id: - xa_erase(&client->hwctx_xa, hwctx->id); +fini_hwctx: + xdna->dev_info->ops->hwctx_fini(hwctx); +dev_exit: + drm_dev_exit(idx); free_hwctx: kfree(hwctx); -exit: - drm_dev_exit(idx); return ret; } =20 @@ -431,11 +432,6 @@ int amdxdna_cmd_submit(struct amdxdna_client *client, goto unlock_srcu; } =20 - if (hwctx->status !=3D HWCTX_STAT_READY) { - XDNA_ERR(xdna, "HW Context is not ready"); - ret =3D -EINVAL; - goto unlock_srcu; - } =20 job->hwctx =3D hwctx; job->mm =3D current->mm; diff --git a/drivers/accel/amdxdna/amdxdna_mailbox.c b/drivers/accel/amdxdn= a/amdxdna_mailbox.c index da1ac89bb78f..24258dcc18eb 100644 --- a/drivers/accel/amdxdna/amdxdna_mailbox.c +++ b/drivers/accel/amdxdna/amdxdna_mailbox.c @@ -194,7 +194,8 @@ static void mailbox_release_msg(struct mailbox_channel = *mb_chann, { MB_DBG(mb_chann, "msg_id 0x%x msg opcode 0x%x", mb_msg->pkg.header.id, mb_msg->pkg.header.opcode); - mb_msg->notify_cb(mb_msg->handle, NULL, 0); + if (mb_msg->notify_cb) + mb_msg->notify_cb(mb_msg->handle, NULL, 0); kfree(mb_msg); } =20 @@ -248,7 +249,7 @@ mailbox_get_resp(struct mailbox_channel *mb_chann, stru= ct xdna_msg_header *heade { struct mailbox_msg *mb_msg; int msg_id; - int ret; + int ret =3D 0; =20 msg_id =3D header->id; if (!mailbox_validate_msgid(msg_id)) { @@ -265,9 +266,11 @@ mailbox_get_resp(struct mailbox_channel *mb_chann, str= uct xdna_msg_header *heade =20 MB_DBG(mb_chann, "opcode 0x%x size %d id 0x%x", header->opcode, header->total_size, header->id); - ret =3D mb_msg->notify_cb(mb_msg->handle, data, header->total_size); - if (unlikely(ret)) - MB_ERR(mb_chann, "Message callback ret %d", ret); + if (mb_msg->notify_cb) { + ret =3D mb_msg->notify_cb(mb_msg->handle, data, header->total_size); + if (unlikely(ret)) + MB_ERR(mb_chann, "Message callback ret %d", ret); + } =20 kfree(mb_msg); return ret; diff --git a/drivers/accel/amdxdna/amdxdna_pci_drv.c b/drivers/accel/amdxdn= a/amdxdna_pci_drv.c index 569cd703729d..aa04452310e5 100644 --- a/drivers/accel/amdxdna/amdxdna_pci_drv.c +++ b/drivers/accel/amdxdna/amdxdna_pci_drv.c @@ -13,13 +13,11 @@ #include #include #include -#include =20 #include "amdxdna_ctx.h" #include "amdxdna_gem.h" #include "amdxdna_pci_drv.h" - -#define AMDXDNA_AUTOSUSPEND_DELAY 5000 /* milliseconds */ +#include "amdxdna_pm.h" =20 MODULE_FIRMWARE("amdnpu/1502_00/npu.sbin"); MODULE_FIRMWARE("amdnpu/17f0_10/npu.sbin"); @@ -61,17 +59,9 @@ static int amdxdna_drm_open(struct drm_device *ddev, str= uct drm_file *filp) struct amdxdna_client *client; int ret; =20 - ret =3D pm_runtime_resume_and_get(ddev->dev); - if (ret) { - XDNA_ERR(xdna, "Failed to get rpm, ret %d", ret); - return ret; - } - client =3D kzalloc(sizeof(*client), GFP_KERNEL); - if (!client) { - ret =3D -ENOMEM; - goto put_rpm; - } + if (!client) + return -ENOMEM; =20 client->pid =3D pid_nr(rcu_access_pointer(filp->pid)); client->xdna =3D xdna; @@ -106,9 +96,6 @@ static int amdxdna_drm_open(struct drm_device *ddev, str= uct drm_file *filp) iommu_sva_unbind_device(client->sva); failed: kfree(client); -put_rpm: - pm_runtime_mark_last_busy(ddev->dev); - pm_runtime_put_autosuspend(ddev->dev); =20 return ret; } @@ -130,8 +117,6 @@ static void amdxdna_drm_close(struct drm_device *ddev, = struct drm_file *filp) =20 XDNA_DBG(xdna, "pid %d closed", client->pid); kfree(client); - pm_runtime_mark_last_busy(ddev->dev); - pm_runtime_put_autosuspend(ddev->dev); } =20 static int amdxdna_flush(struct file *f, fl_owner_t id) @@ -310,19 +295,12 @@ static int amdxdna_probe(struct pci_dev *pdev, const = struct pci_device_id *id) goto failed_dev_fini; } =20 - pm_runtime_set_autosuspend_delay(dev, AMDXDNA_AUTOSUSPEND_DELAY); - pm_runtime_use_autosuspend(dev); - pm_runtime_allow(dev); - ret =3D drm_dev_register(&xdna->ddev, 0); if (ret) { XDNA_ERR(xdna, "DRM register failed, ret %d", ret); - pm_runtime_forbid(dev); goto failed_sysfs_fini; } =20 - pm_runtime_mark_last_busy(dev); - pm_runtime_put_autosuspend(dev); return 0; =20 failed_sysfs_fini: @@ -339,14 +317,10 @@ static int amdxdna_probe(struct pci_dev *pdev, const = struct pci_device_id *id) static void amdxdna_remove(struct pci_dev *pdev) { struct amdxdna_dev *xdna =3D pci_get_drvdata(pdev); - struct device *dev =3D &pdev->dev; struct amdxdna_client *client; =20 destroy_workqueue(xdna->notifier_wq); =20 - pm_runtime_get_noresume(dev); - pm_runtime_forbid(dev); - drm_dev_unplug(&xdna->ddev); amdxdna_sysfs_fini(xdna); =20 @@ -365,29 +339,9 @@ static void amdxdna_remove(struct pci_dev *pdev) mutex_unlock(&xdna->dev_lock); } =20 -static int amdxdna_pmops_suspend(struct device *dev) -{ - struct amdxdna_dev *xdna =3D pci_get_drvdata(to_pci_dev(dev)); - - if (!xdna->dev_info->ops->suspend) - return -EOPNOTSUPP; - - return xdna->dev_info->ops->suspend(xdna); -} - -static int amdxdna_pmops_resume(struct device *dev) -{ - struct amdxdna_dev *xdna =3D pci_get_drvdata(to_pci_dev(dev)); - - if (!xdna->dev_info->ops->resume) - return -EOPNOTSUPP; - - return xdna->dev_info->ops->resume(xdna); -} - static const struct dev_pm_ops amdxdna_pm_ops =3D { - SYSTEM_SLEEP_PM_OPS(amdxdna_pmops_suspend, amdxdna_pmops_resume) - RUNTIME_PM_OPS(amdxdna_pmops_suspend, amdxdna_pmops_resume, NULL) + SYSTEM_SLEEP_PM_OPS(amdxdna_pm_suspend, amdxdna_pm_resume) + RUNTIME_PM_OPS(amdxdna_pm_suspend, amdxdna_pm_resume, NULL) }; =20 static struct pci_driver amdxdna_pci_driver =3D { diff --git a/drivers/accel/amdxdna/amdxdna_pci_drv.h b/drivers/accel/amdxdn= a/amdxdna_pci_drv.h index 72d6696d49da..626beebf730e 100644 --- a/drivers/accel/amdxdna/amdxdna_pci_drv.h +++ b/drivers/accel/amdxdna/amdxdna_pci_drv.h @@ -6,6 +6,7 @@ #ifndef _AMDXDNA_PCI_DRV_H_ #define _AMDXDNA_PCI_DRV_H_ =20 +#include #include #include =20 @@ -99,6 +100,7 @@ struct amdxdna_dev { struct amdxdna_fw_ver fw_ver; struct rw_semaphore notifier_lock; /* for mmu notifier*/ struct workqueue_struct *notifier_wq; + bool rpm_on; }; =20 /* diff --git a/drivers/accel/amdxdna/amdxdna_pm.c b/drivers/accel/amdxdna/amd= xdna_pm.c new file mode 100644 index 000000000000..fad14f60b99b --- /dev/null +++ b/drivers/accel/amdxdna/amdxdna_pm.c @@ -0,0 +1,96 @@ +// SPDX-License-Identifier: GPL-2.0 +/* + * Copyright (C) 2025, Advanced Micro Devices, Inc. + */ + +#include +#include +#include + +#include "amdxdna_pm.h" + +#define AMDXDNA_AUTOSUSPEND_DELAY 5000 /* milliseconds */ + +int amdxdna_pm_suspend(struct device *dev) +{ + struct amdxdna_dev *xdna =3D to_xdna_dev(dev_get_drvdata(dev)); + int ret =3D -EOPNOTSUPP; + bool rpm; + + if (xdna->dev_info->ops->suspend) { + rpm =3D xdna->rpm_on; + xdna->rpm_on =3D false; + ret =3D xdna->dev_info->ops->suspend(xdna); + xdna->rpm_on =3D rpm; + } + + XDNA_DBG(xdna, "Suspend done ret %d", ret); + return ret; +} + +int amdxdna_pm_resume(struct device *dev) +{ + struct amdxdna_dev *xdna =3D to_xdna_dev(dev_get_drvdata(dev)); + int ret =3D -EOPNOTSUPP; + bool rpm; + + if (xdna->dev_info->ops->resume) { + rpm =3D xdna->rpm_on; + xdna->rpm_on =3D false; + ret =3D xdna->dev_info->ops->resume(xdna); + xdna->rpm_on =3D rpm; + } + + XDNA_DBG(xdna, "Resume done ret %d", ret); + return ret; +} + +int amdxdna_pm_resume_get(struct amdxdna_dev *xdna) +{ + struct device *dev =3D xdna->ddev.dev; + int ret; + + if (!xdna->rpm_on) + return 0; + + ret =3D pm_runtime_resume_and_get(dev); + if (ret) { + XDNA_ERR(xdna, "Resume failed: %d", ret); + pm_runtime_set_suspended(dev); + } + + return ret; +} + +void amdxdna_pm_suspend_put(struct amdxdna_dev *xdna) +{ + struct device *dev =3D xdna->ddev.dev; + + if (!xdna->rpm_on) + return; + + pm_runtime_mark_last_busy(dev); + pm_runtime_put_autosuspend(dev); +} + +void amdxdna_rpm_init(struct amdxdna_dev *xdna) +{ + struct device *dev =3D xdna->ddev.dev; + + pm_runtime_set_active(dev); + pm_runtime_set_autosuspend_delay(dev, AMDXDNA_AUTOSUSPEND_DELAY); + pm_runtime_use_autosuspend(dev); + pm_runtime_allow(dev); + pm_runtime_mark_last_busy(dev); + pm_runtime_put_autosuspend(dev); + xdna->rpm_on =3D true; +} + +void amdxdna_rpm_fini(struct amdxdna_dev *xdna) +{ + struct device *dev =3D xdna->ddev.dev; + + xdna->rpm_on =3D false; + pm_runtime_get_noresume(dev); + pm_runtime_forbid(dev); +} diff --git a/drivers/accel/amdxdna/amdxdna_pm.h b/drivers/accel/amdxdna/amd= xdna_pm.h new file mode 100644 index 000000000000..11c295b6d64a --- /dev/null +++ b/drivers/accel/amdxdna/amdxdna_pm.h @@ -0,0 +1,18 @@ +/* SPDX-License-Identifier: GPL-2.0 */ +/* + * Copyright (C) 2025, Advanced Micro Devices, Inc. + */ + +#ifndef _AMDXDNA_PM_H_ +#define _AMDXDNA_PM_H_ + +#include "amdxdna_pci_drv.h" + +int amdxdna_pm_suspend(struct device *dev); +int amdxdna_pm_resume(struct device *dev); +int amdxdna_pm_resume_get(struct amdxdna_dev *xdna); +void amdxdna_pm_suspend_put(struct amdxdna_dev *xdna); +void amdxdna_rpm_init(struct amdxdna_dev *xdna); +void amdxdna_rpm_fini(struct amdxdna_dev *xdna); + +#endif /* _AMDXDNA_PM_H_ */ --=20 2.34.1