From nobody Sat Feb 7 23:22:47 2026 Received: from mail-pf1-f170.google.com (mail-pf1-f170.google.com [209.85.210.170]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id BB9C21411DE for ; Wed, 29 Oct 2025 02:31:58 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.210.170 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705120; cv=none; b=tLOh0Z8zX0wJAJjg1Zif6TdSD6lB+N1se+qF44LIHH6XNTvkeKFTU7w23YebuUg2W+mxf38LbWy4OWKgHvJOmR3N6/H27iVXISPuljNqw5Itelo5ut94gnj46olEw3XrDcCgbh/ePsdld6McCOSRaCLwqSUchvMS+1LM/mkiSWI= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705120; c=relaxed/simple; bh=xAcUDyR46r3eyTpxz6DzBXXtrvoA66g03wdR/Ic7OY8=; h=From:To:Cc:Subject:Date:Message-Id:In-Reply-To:References: MIME-Version:Content-Type; b=sd/waB1v0er2DK/ARiheS+P6Lzjs6Dhx6K3iBFBHySmE+t8xsuvZkK7LfVnRxzW702uNPWQ8Mdgnjm7hXulMabtt7adOLW4g71tjicKxY/SY9c1UCDxbm69YfJickuWOZ3LMS0JY7VeGVJryy6G5DZ+1gMIZfl5B61NVvSJS6ug= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=HTmDkIZb; arc=none smtp.client-ip=209.85.210.170 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="HTmDkIZb" Received: by mail-pf1-f170.google.com with SMTP id d2e1a72fcca58-78af3fe5b17so5201984b3a.2 for ; Tue, 28 Oct 2025 19:31:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1761705118; x=1762309918; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=G8IHS5hbKcNzc4xX1pj2JUqHqgY0jEt1+7hP5OgmCMs=; b=HTmDkIZbIqs6VxB/n4OkC7hqT/7I5LA2JJoyjDlOyIxGNZgVz8gDqSyB5HKECv2bQl 4Q2HJCIjBsJuxJ8i7Od6Y/Vdf4ChV4pBj8eYRgF/sO2QXGuQIHPanJuWOh1IPz9hGJTs wtafVjHVex22gUOLcty43+4Y7Xo2Rv7o4iKvLQAUscO6AFc8wBeVUDpCmsUakw6M2pN8 n65h3ndc/5YPfvFZIcCKT1aU9COsjggoxXLD4048KVCtt/z/oaz/dwFC5guosF1kqeAu /iUqT2HLV10ZGZox6LRgWBWE/+SW9rsUgfxsbNurdtR8iOngVZovSWAgIfp1hBQTlhs9 Ymfg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1761705118; x=1762309918; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=G8IHS5hbKcNzc4xX1pj2JUqHqgY0jEt1+7hP5OgmCMs=; b=UEuUbGy4NSgbEbEwvqaEeOuKtOXynsk/bZ1fGccBaQuCxcvycMzUyZB+csepJ/rQdZ 4oJCz3YclEIoqf5EYyb5ppwdrBArz2qIGnZRkyBT51TM7Dvxm/fZKMhyifmRzHsvVv8J eLa2IbOkLWRIOGJ2dArc0yu/upUtP3rveYa/R8GqyZN3hnpzwtszbLnGIhtPWNsLGBla XT+smlZDGRKctkiNwgSiL02nTlck598aKKibFVNMLpfdohsWrc932SxtJLZ1LjIJ/otD c46zfJMpu2bc5dB134+0iHJ48HbnZaAwyPwZ5SHp5zHBtv1kKGkylTeEoIXVvM8E63NH IhpQ== X-Forwarded-Encrypted: i=1; AJvYcCX4MzpTBTf9YFNDY8TXnJttkQUfF6FsBvez8k362M1bJpPh3+9sEY9mhOkHsbs4DiiHqMv9c7S6N8dryUE=@vger.kernel.org X-Gm-Message-State: AOJu0Yw5rSgEnZn3UJCGV08KI7rUkZssCsQKdnkVqViQN7p0pBfbneAp sLwAq3pI5PZaTfTFnoZFq/2wWXfNYKv0kNejviwVnOEvvRmhZsF/ghy5 X-Gm-Gg: ASbGncuHvrVLuOf1BnHlwfdOiwTj8IfeeVFaN4PIgZuDtIgvkpojALPX4B2XALVg5hy V4Bw7xgRw1c72W09NhgerziHlfXPp9u2D8W/ZvfFG4LJvtCTsY9XUtx2VYLrHOwlE4qUtq5qUeh DuiG8G4OjsbIUGmHaxo5Rk0yRpa1PJexa0Utws/VNuXizlx4Hkn9R5cutDuITB+J/QlwIESz6Vh mEIVNoyHXbXGBESUYS5FfCEIyCqUFjAyta3Yeni7Vt/szNmshk5uJTb2S8p5k+7JbKJIpUsEMJW UUdme+v/imgrjcn7x7keIbTDVXoQFmUQrwhCL00W2q6KlIxf2/hxm9a2QhzU6WwRGoFo091SVQg tiNxEic5ZConL0Jl4IIxbInyr7Lx1oauWXjLPz0ltST5VfQRQ9Dz/S5eBXfcL4QuGpud4TxTvvS souTIba/0NnXN03Fn2IXM4IQHTvdoi+TQd/jiesk9c0QWosjNLzftLjx02og== X-Google-Smtp-Source: AGHT+IG+neopqqpPz8pE4OPY5Yo0pVr1dn6PvUKKBu7rSRz5cLkoZ3c+NU5U1HLhCN7SM0Z/zmD0cg== X-Received: by 2002:a05:6a20:3ca7:b0:344:97a7:8c60 with SMTP id adf61e73a8af0-34651b39327mr1494729637.17.1761705117917; Tue, 28 Oct 2025 19:31:57 -0700 (PDT) Received: from localhost.localdomain ([47.72.128.212]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-33fed81a4afsm13649518a91.19.2025.10.28.19.31.50 (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Tue, 28 Oct 2025 19:31:57 -0700 (PDT) From: Barry Song <21cnbao@gmail.com> To: Catalin Marinas , Will Deacon , Marek Szyprowski , Robin Murphy Cc: Barry Song , Ada Couprie Diaz , Ard Biesheuvel , Marc Zyngier , Anshuman Khandual , Ryan Roberts , Suren Baghdasaryan , Tangquan Zheng , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, iommu@lists.linux.dev Subject: [RFC PATCH 1/5] arm64: Provide dcache_by_myline_op_nosync helper Date: Wed, 29 Oct 2025 10:31:11 +0800 Message-Id: <20251029023115.22809-2-21cnbao@gmail.com> X-Mailer: git-send-email 2.39.3 (Apple Git-146) In-Reply-To: <20251029023115.22809-1-21cnbao@gmail.com> References: <20251029023115.22809-1-21cnbao@gmail.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable From: Barry Song dcache_by_myline_op ensures completion of the data cache operations for a region, while dcache_by_myline_op_nosync only issues them without waiting. This enables deferred synchronization so completion for multiple regions can be handled together later. Cc: Catalin Marinas Cc: Will Deacon Cc: Marek Szyprowski Cc: Robin Murphy Cc: Ada Couprie Diaz Cc: Ard Biesheuvel Cc: Marc Zyngier Cc: Anshuman Khandual Cc: Ryan Roberts Cc: Suren Baghdasaryan Cc: Tangquan Zheng Cc: linux-arm-kernel@lists.infradead.org Cc: linux-kernel@vger.kernel.org Cc: iommu@lists.linux.dev Signed-off-by: Barry Song --- arch/arm64/include/asm/assembler.h | 79 ++++++++++++++++++++++-------- 1 file changed, 59 insertions(+), 20 deletions(-) diff --git a/arch/arm64/include/asm/assembler.h b/arch/arm64/include/asm/as= sembler.h index 23be85d93348..115196ce4800 100644 --- a/arch/arm64/include/asm/assembler.h +++ b/arch/arm64/include/asm/assembler.h @@ -366,22 +366,7 @@ alternative_else alternative_endif .endm =20 -/* - * Macro to perform a data cache maintenance for the interval - * [start, end) with dcache line size explicitly provided. - * - * op: operation passed to dc instruction - * domain: domain used in dsb instruciton - * start: starting virtual address of the region - * end: end virtual address of the region - * linesz: dcache line size - * fixup: optional label to branch to on user fault - * Corrupts: start, end, tmp - */ - .macro dcache_by_myline_op op, domain, start, end, linesz, tmp, fixup - sub \tmp, \linesz, #1 - bic \start, \start, \tmp -.Ldcache_op\@: + .macro __dcache_op_line op, start .ifc \op, cvau __dcache_op_workaround_clean_cache \op, \start .else @@ -399,14 +384,54 @@ alternative_endif .endif .endif .endif - add \start, \start, \linesz - cmp \start, \end - b.lo .Ldcache_op\@ - dsb \domain + .endm + +/* + * Macro to perform a data cache maintenance for the interval + * [start, end) with dcache line size explicitly provided. + * + * op: operation passed to dc instruction + * domain: domain used in dsb instruciton + * start: starting virtual address of the region + * end: end virtual address of the region + * linesz: dcache line size + * fixup: optional label to branch to on user fault + * Corrupts: start, end, tmp + */ + .macro dcache_by_myline_op op, domain, start, end, linesz, tmp, fixup + sub \tmp, \linesz, #1 + bic \start, \start, \tmp +.Ldcache_op\@: + __dcache_op_line \op, \start + add \start, \start, \linesz + cmp \start, \end + b.lo .Ldcache_op\@ =20 + dsb \domain _cond_uaccess_extable .Ldcache_op\@, \fixup .endm =20 +/* + * Macro to perform a data cache maintenance for the interval + * [start, end) with dcache line size explicitly provided. + * It won't wait for the completion of the dc operation. + * + * op: operation passed to dc instruction + * start: starting virtual address of the region + * end: end virtual address of the region + * linesz: dcache line size + * Corrupts: start, end, tmp + */ + .macro dcache_by_myline_op_nosync op, start, end, linesz, tmp + sub \tmp, \linesz, #1 + bic \start, \start, \tmp +.Ldcache_op\@: + __dcache_op_line \op, \start + add \start, \start, \linesz + cmp \start, \end + b.lo .Ldcache_op\@ + .endm + /* * Macro to perform a data cache maintenance for the interval * [start, end) @@ -423,6 +448,20 @@ alternative_endif dcache_by_myline_op \op, \domain, \start, \end, \tmp1, \tmp2, \fixup .endm =20 +/* + * Macro to perform a data cache maintenance for the interval + * [start, end). It won=E2=80=99t wait for the dc operation to complete. + * + * op: operation passed to dc instruction + * start: starting virtual address of the region + * end: end virtual address of the region + * Corrupts: start, end, tmp1, tmp2 + */ + .macro dcache_by_line_op_nosync op, start, end, tmp1, tmp2 + dcache_line_size \tmp1, \tmp2 + dcache_by_myline_op_nosync \op, \start, \end, \tmp1, \tmp2 + .endm + /* * Macro to perform an instruction cache maintenance for the interval * [start, end) --=20 2.39.3 (Apple Git-146) From nobody Sat Feb 7 23:22:47 2026 Received: from mail-pj1-f46.google.com (mail-pj1-f46.google.com [209.85.216.46]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 753072E7F29 for ; Wed, 29 Oct 2025 02:32:07 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.46 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705129; cv=none; b=mOGkIlmf/rACfVm2sY+VaVEeujiFaCbNhbwzAQ0lqCntVgOEr8XpkEhLD+0CE/pVMFOLpurdqzG6789JfG/E3EPUjlpvVBi4CfI8do+fRi29bXD9pLaZtxj8X7S3PduqhaVK7sNo3P+8+SwlDv0pgus26+RVQzMej/G3/InDi7c= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705129; c=relaxed/simple; bh=zYNJmlon8r/JHG3Xpn9ntElzz/UgGYjwUNixebL2tkg=; h=From:To:Cc:Subject:Date:Message-Id:In-Reply-To:References: MIME-Version; b=AcyXCoNSBhui26SoEP24BUuRLgF9kG+2ghotrZzsWE6pEy9vxHzkyYgdi6oOYdYnErChqbqALukXa5YVxuwkw9TYavZClRrmZTx32HUe/8lsetI8SpLEeu2EwwN0dm5plgK0J+U53WxIFXAPDvKsf7vPT/AfMGjNgoidV6S5ct0= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=EFy2gnHy; arc=none smtp.client-ip=209.85.216.46 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="EFy2gnHy" Received: by mail-pj1-f46.google.com with SMTP id 98e67ed59e1d1-33255011eafso6281668a91.1 for ; Tue, 28 Oct 2025 19:32:07 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1761705127; x=1762309927; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=i6qoeYX7P7dZob4Bvz9VKgpFSbzrkhmCImjfBPhgn1s=; b=EFy2gnHyZ5BWurS6tG5Li664Lvj8+ODJHoCqwRczBSG6eyikl4oW6KrtnQdXvQPCzJ P+pu2ZgXrPOmm8jVfGZsI/S3nu3qWelGIqyFdAwT1k2Xk14NmtBqrBCIIasJxWyYcVFf JJdM2r8avql/ynSr7h4qohn+jBYiZi+2qQoRq965wDbL83u9B3OcAppzzK3bh3AH2Lhg BJrAmxJU5KKU351Vz5wkbYyP4KQRt/EtYnPAN0NSRqsS/zu6cKUfGAMo0Yg9oFQAhFRk 2OOhQtxFGtWfOG37roGJA2M5aRPPnyCOtnCruesp7IDFySXvWl+o9/Jm7x0nwe+bDBJU slhA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1761705127; x=1762309927; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=i6qoeYX7P7dZob4Bvz9VKgpFSbzrkhmCImjfBPhgn1s=; b=DA9zXPWykQpyC9jiQI+UCKfgDlM+uxjW73lfXIgbDfix9zT9Yj4g4Qo6/WVFI0F+1H P15jOHqY6/K+NzAbpVy7e1gEXYN3LItFaglIKXl5YZ9sV3OWErcgRJ3aM3oyJ29gJa6z jIvSbBnJ+fUiP2wuYCqL6niP79ctCzdOd2r8sNLuiP+Wd5tczt61eIItybrkxU5HZW5C Uhu8G12P5jQPGwEUYUXdrY30wzn5Y68CxJFupQDl8NJjNoXUvkwAZ4+ahSF2a0hg9ci1 9PIrIWWiemrq7x9s4LpkkCGEh7c/0qrFB7PcSZw1+N3zZ+OqoUCwMEhAFVqeDzXhFKCW nbNg== X-Forwarded-Encrypted: i=1; AJvYcCXiTyLGSMmTkIC0/l70U2QqZ2wZ1G8WxxShSgmgkuSu3V5P/rACi5y+DsRH2X8JWaL8Bj8V0VTnV/TLfEo=@vger.kernel.org X-Gm-Message-State: AOJu0YzTNxAAPaxzRWBd93xGwDXkvD9HOOJmJIaGYJz/4KkxhkV8m0tl EYY5/kxc6aj1BSiAlTNreZySZDHuCbPGNo2e+ataxM/LvK0U3gdYi5/E X-Gm-Gg: ASbGncsJjs+awCBAhQya2f3BJEZhjqRXYR7AEHTDbOEU6cErOcWpV/3cfBU8LNQhuz+ SeF92MntZlFEeD224ruJX+Z2aLxDr4hj1KgYA1jYuOy2PNOYqvjIjkO35n+GoTgIVRNm3Cifxo/ l/klr/LLp6Zt6PLP83z+TqLBWJQLHf0OKr7cPLvyTxGmWzjVCkhQLaROEUySc6NGo8XzJo9hMtY CF/Kzr1GoT+06kqkG/OiTuryZCtGIF2NAWaPBjV28ELHlyr9Qg47/PSigSlsheZZhdbQmCI/IBh 4HsntEtIIgXVUPMseUlhjpx6Cmmm4R9cmkdlnOMPd6VsQIqtT3b93rYpEcrBi+u/jU7xLwfcJaj av5k5Au9hIdO80anCuSOZh2uCJ+AyoOezDszxGU2GWF4lZrWYGGOHMVMOsI5TpKLCwp1CBOiD6d ABwz1x5keotV7msBtZhKlphN0nnCL3U2bTnf8/RBAryDSDWBN1z6TUz0X+wg== X-Google-Smtp-Source: AGHT+IELIohm+a85sePD0UFgSVnJbmnjewzzZy+ustTJJxjo6UFTuJaxy6bk5m8Cbv3z6kHyDqZwZg== X-Received: by 2002:a17:90b:5285:b0:32d:e75a:5009 with SMTP id 98e67ed59e1d1-3403a15d171mr1418626a91.1.1761705126491; Tue, 28 Oct 2025 19:32:06 -0700 (PDT) Received: from localhost.localdomain ([47.72.128.212]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-33fed81a4afsm13649518a91.19.2025.10.28.19.32.00 (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Tue, 28 Oct 2025 19:32:05 -0700 (PDT) From: Barry Song <21cnbao@gmail.com> To: Catalin Marinas , Will Deacon , Marek Szyprowski , Robin Murphy Cc: Barry Song , Ada Couprie Diaz , Ard Biesheuvel , Marc Zyngier , Anshuman Khandual , Ryan Roberts , Suren Baghdasaryan , Tangquan Zheng , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, iommu@lists.linux.dev Subject: [RFC PATCH 2/5] arm64: Provide dcache_clean_poc_nosync helper Date: Wed, 29 Oct 2025 10:31:12 +0800 Message-Id: <20251029023115.22809-3-21cnbao@gmail.com> X-Mailer: git-send-email 2.39.3 (Apple Git-146) In-Reply-To: <20251029023115.22809-1-21cnbao@gmail.com> References: <20251029023115.22809-1-21cnbao@gmail.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" From: Barry Song dcache_clean_poc_nosync does not wait for the data cache clean to complete. Later, we wait for completion of all scatter-gather entries together. Cc: Catalin Marinas Cc: Will Deacon Cc: Marek Szyprowski Cc: Robin Murphy Cc: Ada Couprie Diaz Cc: Ard Biesheuvel Cc: Marc Zyngier Cc: Anshuman Khandual Cc: Ryan Roberts Cc: Suren Baghdasaryan Cc: Tangquan Zheng Cc: linux-arm-kernel@lists.infradead.org Cc: linux-kernel@vger.kernel.org Cc: iommu@lists.linux.dev Signed-off-by: Barry Song --- arch/arm64/include/asm/cacheflush.h | 1 + arch/arm64/mm/cache.S | 15 +++++++++++++++ 2 files changed, 16 insertions(+) diff --git a/arch/arm64/include/asm/cacheflush.h b/arch/arm64/include/asm/c= acheflush.h index 28ab96e808ef..9b6d0a62cf3d 100644 --- a/arch/arm64/include/asm/cacheflush.h +++ b/arch/arm64/include/asm/cacheflush.h @@ -74,6 +74,7 @@ extern void icache_inval_pou(unsigned long start, unsigne= d long end); extern void dcache_clean_inval_poc(unsigned long start, unsigned long end); extern void dcache_inval_poc(unsigned long start, unsigned long end); extern void dcache_clean_poc(unsigned long start, unsigned long end); +extern void dcache_clean_poc_nosync(unsigned long start, unsigned long end= ); extern void dcache_clean_pop(unsigned long start, unsigned long end); extern void dcache_clean_pou(unsigned long start, unsigned long end); extern long caches_clean_inval_user_pou(unsigned long start, unsigned long= end); diff --git a/arch/arm64/mm/cache.S b/arch/arm64/mm/cache.S index 503567c864fd..4a7c7e03785d 100644 --- a/arch/arm64/mm/cache.S +++ b/arch/arm64/mm/cache.S @@ -178,6 +178,21 @@ SYM_FUNC_START(__pi_dcache_clean_poc) SYM_FUNC_END(__pi_dcache_clean_poc) SYM_FUNC_ALIAS(dcache_clean_poc, __pi_dcache_clean_poc) =20 +/* + * dcache_clean_poc_nosync(start, end) + * + * Issue the instructions of D-cache lines for the interval [start, end). + * not necessarily cleaned to the PoC till an explicit dsb sy afterward. + * + * - start - virtual start address of region + * - end - virtual end address of region + */ +SYM_FUNC_START(__pi_dcache_clean_poc_nosync) + dcache_by_line_op_nosync cvac, x0, x1, x2, x3 + ret +SYM_FUNC_END(__pi_dcache_clean_poc_nosync) +SYM_FUNC_ALIAS(dcache_clean_poc_nosync, __pi_dcache_clean_poc_nosync) + /* * dcache_clean_pop(start, end) * --=20 2.39.3 (Apple Git-146) From nobody Sat Feb 7 23:22:47 2026 Received: from mail-pj1-f48.google.com (mail-pj1-f48.google.com [209.85.216.48]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id C93DA2EAB72 for ; Wed, 29 Oct 2025 02:32:14 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.48 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705136; cv=none; b=GOMsNfSQBX7Zgr5p5CGIydcuZM1XxzW0YhTuP7S89XKGab6nIrTv3V0Oj4a30LCOeIKdIvqxz1R04romLxkGKuioGDAHLZAo6v3PDcDYfYCBRwxyh72kQY7BMrTbp3p4hRrA9DYQZY8ym8HXrjb1MnesFGWduqMDHozNzhA2xJ0= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705136; c=relaxed/simple; bh=hEllaWUDapEMGyxdObch3Q1AUPdojmO2+4mGzAGRdlI=; h=From:To:Cc:Subject:Date:Message-Id:In-Reply-To:References: MIME-Version; b=Wf2xU6bYp6kNS9ak16JKV0cicenFTPh1HfHE66/FTzUXCEG14Ehu6wGYe8zx5PomaA2MNHZyVRYNlE8SotRvYSqJTq8emoGm+g19YrRSfn0M4Zn80Mt75p3qrJdkGSriLYqWBp2VcyrOiWllOE50wMx0lV74l8SM1h9QsOsLBrE= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=Nb85jhTA; arc=none smtp.client-ip=209.85.216.48 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="Nb85jhTA" Received: by mail-pj1-f48.google.com with SMTP id 98e67ed59e1d1-339e71ccf48so8553235a91.3 for ; Tue, 28 Oct 2025 19:32:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1761705134; x=1762309934; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=uUUL7svE/qO1mMkeybYnmgbSMQwYmeEBqJj7PT/8HRw=; b=Nb85jhTAcMPn1vAHM4O7fYWVZBXnZ5EQJK+zquAxjLh9qJcKVA4n9RWn72MOkylWb6 uDVQ7Wk/5dGDeP5QfjsXe1/65hgqxT5pKD7F24zXWYKk9SAAkNrq0AhIVztF69TfWmje eiR+/Ek1skWHKfzV7KNZuy7Y4rOJAtRGnc4B+Gw+sRz3SsegVBSjYNI4pTQfV21gZxbC m+qXUoH6ErWbYh2GIWZRcG7bURNKkc3WbDEXDpByYL575c1XOnPB8pVWAdnGS8QMLHEU 4qvg7EEQPsuoHjEEgNQZ12xy7VqklYmDyHX72WbQajCfa6beBN3F7zpExxMWsq2+QWhs dCJg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1761705134; x=1762309934; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=uUUL7svE/qO1mMkeybYnmgbSMQwYmeEBqJj7PT/8HRw=; b=rStWA+mtulGp5RWFRucNZCDW1E/jV+8ot3aDJoCvirYw0x7iOpnos/mTmaLIhVSZSz qyuV0XOJg+u4zScqX2EEJXWri+l5yFm5ZhDluS0X2vbgANcdFhH2ugrXtMzlBPEzf7qp 5sjc1NmgU7xHuEd45o3+H1Xqc3mvwPOHEP1yDPANAWiaGE9rdWYHWv6e60atU9nBk36I kNK6RMNU5pkq1NCmmbotAnvtl3lp9wTkXcq8cRwXbvOv9XD1khhA+nPSrXLOrWOB9lnX 2tZaCtpDDHkDpEyOlGXN3vZbGaXSjsq+V9wmyT7yMhLF201CGKNRhOKzJ0Ae/jD1ZDrU nn8Q== X-Forwarded-Encrypted: i=1; AJvYcCVi+9HN62Rk7CcTCLPzSnRznJDy3kjSnzLMR/+Id0k9UV8tMCIgJ8rD7zfX0LLI5d687Q3N4Mppi9VyUUw=@vger.kernel.org X-Gm-Message-State: AOJu0YwTR+vwKLx/kebvkC0uPdjaBr8HV4bOXxtjEMgIQPKMJazSzbru UkAJ4ON5T/QrkA5XglHPOBScUStAMksqtZpdyPNXqtyRBiKdcRcreMwx X-Gm-Gg: ASbGncurU6y2cd/poMz0eHIvNVXnAaoQ7+8hoZAMDPLZfWAs2J+GNua/d8pXmBxyM5v 0mTNdKmvGi+bGNiXF7Sk4+FoQG3M5WVa6+tUy4U7A9BqcLhhY5+ExJObr85hktAm2xU1MW/QySw z77Xs9aDcdbvekc051vZS7AgIY2rXT+D9lRo1Es0Mh6PZf4HIpuOC6NjbnVA+e4AGK/1HjQLGMc oOEBBZCsusXONH6urSxjGKp8hxIyPABHRX0CrpM/Y0BnGWfCX4BrDEOM7TxZR2rJZnl56KhpWlV sHoxsVoyb+kBSmUGu5PHsC8A4lFgx0BWAKmAIpzTr5XlEUc8at4z1XSrdzMh6gmDFXS2QbHIxgw x4t036MCnzcxyS37G2zl6OdEdd1M4PLbKeQVy51D6arMiuUos+bqQg6Yivfgjkt0obrwuXRF8lY gSZD0sayPkHHmJhtSGoqElzsX84ChTPBgECIC/07Osu7uNyQMLGNkGHFztrw== X-Google-Smtp-Source: AGHT+IFvR7rogTcm5B+3ygGQw/lQVLBAuEII5+Xvx9q6E8V7+VzLsCmMzzmlIs3QIptMEm30AGJ2Ag== X-Received: by 2002:a17:90b:384e:b0:339:9f7d:92d4 with SMTP id 98e67ed59e1d1-3403a265508mr1486973a91.9.1761705133912; Tue, 28 Oct 2025 19:32:13 -0700 (PDT) Received: from localhost.localdomain ([47.72.128.212]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-33fed81a4afsm13649518a91.19.2025.10.28.19.32.08 (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Tue, 28 Oct 2025 19:32:13 -0700 (PDT) From: Barry Song <21cnbao@gmail.com> To: Catalin Marinas , Will Deacon , Marek Szyprowski , Robin Murphy Cc: Barry Song , Ada Couprie Diaz , Ard Biesheuvel , Marc Zyngier , Anshuman Khandual , Ryan Roberts , Suren Baghdasaryan , Tangquan Zheng , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, iommu@lists.linux.dev Subject: [RFC PATCH 3/5] arm64: Provide dcache_inval_poc_nosync helper Date: Wed, 29 Oct 2025 10:31:13 +0800 Message-Id: <20251029023115.22809-4-21cnbao@gmail.com> X-Mailer: git-send-email 2.39.3 (Apple Git-146) In-Reply-To: <20251029023115.22809-1-21cnbao@gmail.com> References: <20251029023115.22809-1-21cnbao@gmail.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" From: Barry Song dcache_inval_poc_nosync does not wait for the data cache invalidation to complete. Later, we defer the synchronization so we can wait for all SG entries together. Cc: Catalin Marinas Cc: Will Deacon Cc: Marek Szyprowski Cc: Robin Murphy Cc: Ada Couprie Diaz Cc: Ard Biesheuvel Cc: Marc Zyngier Cc: Anshuman Khandual Cc: Ryan Roberts Cc: Suren Baghdasaryan Cc: Tangquan Zheng Cc: linux-arm-kernel@lists.infradead.org Cc: linux-kernel@vger.kernel.org Cc: iommu@lists.linux.dev Signed-off-by: Barry Song --- arch/arm64/include/asm/cacheflush.h | 1 + arch/arm64/mm/cache.S | 43 +++++++++++++++++++++-------- 2 files changed, 33 insertions(+), 11 deletions(-) diff --git a/arch/arm64/include/asm/cacheflush.h b/arch/arm64/include/asm/c= acheflush.h index 9b6d0a62cf3d..382b4ac3734d 100644 --- a/arch/arm64/include/asm/cacheflush.h +++ b/arch/arm64/include/asm/cacheflush.h @@ -74,6 +74,7 @@ extern void icache_inval_pou(unsigned long start, unsigne= d long end); extern void dcache_clean_inval_poc(unsigned long start, unsigned long end); extern void dcache_inval_poc(unsigned long start, unsigned long end); extern void dcache_clean_poc(unsigned long start, unsigned long end); +extern void dcache_inval_poc_nosync(unsigned long start, unsigned long end= ); extern void dcache_clean_poc_nosync(unsigned long start, unsigned long end= ); extern void dcache_clean_pop(unsigned long start, unsigned long end); extern void dcache_clean_pou(unsigned long start, unsigned long end); diff --git a/arch/arm64/mm/cache.S b/arch/arm64/mm/cache.S index 4a7c7e03785d..8c1043c9b9e5 100644 --- a/arch/arm64/mm/cache.S +++ b/arch/arm64/mm/cache.S @@ -132,17 +132,7 @@ alternative_else_nop_endif ret SYM_FUNC_END(dcache_clean_pou) =20 -/* - * dcache_inval_poc(start, end) - * - * Ensure that any D-cache lines for the interval [start, end) - * are invalidated. Any partial lines at the ends of the interval are - * also cleaned to PoC to prevent data loss. - * - * - start - kernel start address of region - * - end - kernel end address of region - */ -SYM_FUNC_START(__pi_dcache_inval_poc) +.macro _dcache_inval_poc_impl, do_sync dcache_line_size x2, x3 sub x3, x2, #1 tst x1, x3 // end cache line aligned? @@ -158,11 +148,42 @@ SYM_FUNC_START(__pi_dcache_inval_poc) 3: add x0, x0, x2 cmp x0, x1 b.lo 2b +.if \do_sync dsb sy +.endif ret +.endm + +/* + * dcache_inval_poc(start, end) + * + * Ensure that any D-cache lines for the interval [start, end) + * are invalidated. Any partial lines at the ends of the interval are + * also cleaned to PoC to prevent data loss. + * + * - start - kernel start address of region + * - end - kernel end address of region + */ +SYM_FUNC_START(__pi_dcache_inval_poc) + _dcache_inval_poc_impl 1 SYM_FUNC_END(__pi_dcache_inval_poc) SYM_FUNC_ALIAS(dcache_inval_poc, __pi_dcache_inval_poc) =20 +/* + * dcache_inval_poc_nosync(start, end) + * + * Issue the instructions of D-cache lines for the interval [start, end) + * for invalidation. Not necessarily cleaned to PoC till an explicit dsb + * sy later + * + * - start - kernel start address of region + * - end - kernel end address of region + */ +SYM_FUNC_START(__pi_dcache_inval_poc_nosync) + _dcache_inval_poc_impl 0 +SYM_FUNC_END(__pi_dcache_inval_poc_nosync) +SYM_FUNC_ALIAS(dcache_inval_poc_nosync, __pi_dcache_inval_poc_nosync) + /* * dcache_clean_poc(start, end) * --=20 2.39.3 (Apple Git-146) From nobody Sat Feb 7 23:22:47 2026 Received: from mail-pj1-f46.google.com (mail-pj1-f46.google.com [209.85.216.46]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 298EC1F0994 for ; Wed, 29 Oct 2025 02:32:29 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.46 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705150; cv=none; b=iuWYu27zjuczLxUSj89RPm9B52QGkWknzHb8cr52WDO3LwInWasoSD25Gov0DuP4Dk5pgIvth11wGIPKL1xbT22Oscqh60fmKbRwVB29H2MBuoAf7dDb58yvqiSTxYzra3voVLb8IO5EfgkRNoaZcX7qrAGLAPRufPBuKUJ3HA8= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705150; c=relaxed/simple; bh=kJU0HHsz8RVc7g1I+1f1X0net62Vn5bTl5/+wnElJ50=; h=From:To:Cc:Subject:Date:Message-Id:In-Reply-To:References: MIME-Version; b=iS+XgSR3owQnq39iVOW2bA1kulgI6CJoAqzT+M9yAMI5TJnn1Mtm29bqtwXJJ64SWEGcUVHcUSRD0BKXkaEXZTn8UeAxPzpExxNU3209HFzwxfFscdYsf2E8IoQKV80HId/i0pR3BruPUoYjVey/X7UNI0wbBUgeFDoltZlh4Do= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=O6d/76TP; arc=none smtp.client-ip=209.85.216.46 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="O6d/76TP" Received: by mail-pj1-f46.google.com with SMTP id 98e67ed59e1d1-3304dd2f119so5179612a91.2 for ; Tue, 28 Oct 2025 19:32:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1761705148; x=1762309948; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=EbaG5t0eQSgPX414bWojxTfff0IaUO9MDOrZ/xFSz38=; b=O6d/76TPW/tuiqYjgq8yng7Cu0siXycaf/U+jPR17iX16h7741ee4l3T8D/LHaUWH9 mcWC98loTMeD/e6NcS8c9WW0Ibc0d3mQI3g+VeI+GSkQ49qumN26K/zZHBd7eG2QUKtD CCkdWj8Oqz2SDUYREtMqwQVbKu1bxBYhRX472HChcfBizRohh1C80n94E5jUtD4BplgY sfnwXQLic9Dj7eZGuEEwBPqe4LBAH3UVQOL/kfD3BDTR+nKa0n+wf3ceWwaUvW8UzvRH ODejwwpMLrVrgqst/+yoXnC7RCh/3d7ZQLW91zrwTXIhwkkt+JjbvvuAobjU4/hvD3Gg DCBQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1761705148; x=1762309948; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=EbaG5t0eQSgPX414bWojxTfff0IaUO9MDOrZ/xFSz38=; b=oUq2X4/uqY7zG4Kg+NtgICdKM437Kxf6YlLnNZ/Q2rD613ldkVAj1wtO9zVXzOX7MB RkpP4ogoYOeMx/VToibCyw9GAP4st+8CvA9xNmwXIKB/8KOV0CcDSac7+jEI6zvtD8J7 94VObOOXTHjTmOy44ykRMS9wQt1i90CNOFlXLxl0EFZ5WcVh2e8hk1kbyxCswiERhyRe HmOgrVNqzM1548IDQpqNU5yVZScMipTdweon4WQjZsg6nh8SDXWbOHRtIxW6+QmRWl3R SGvUHtHPOjoP8ASIcC5a89GwsOqyVVNo49a7dxO+EKLJBEDeO8tE+KEtLsx4OC65Aqrn YoXw== X-Forwarded-Encrypted: i=1; AJvYcCW8AUxikXPQD7jmOtimD1b+AxQnjb0XNZpv35DRhzxQjxcR5lyhlXVlrmGHklMXd1YaABvmrIbZbvR5zns=@vger.kernel.org X-Gm-Message-State: AOJu0YxUoX0rMPT94OZITKS1wjraKokqnKGogZnfao9dQqdxrws7F01m IFFYY6fuo9Pati31UEFWqDgFUtDC6ZseC7RIPFcgvZW8IZLLgn21nV4C X-Gm-Gg: ASbGncuu9InDW/IN6wFUZ3nZhx5Fpk+bNvoDSqXF+2MmIvQIkco3rkzRsY+0e7+1rF8 R7M/yxjSbPuLKS10cAIaSH3Q6Yi0772+TPcdG8yT8173r+6RpTm/CSBTKmV9gNQhWYz24IgDF3O fuO+/77RqkyLZjVfJrf1wuJWUIUEMUw336rbuUWxuHJd9AMnxzZi3nwEDUFtp9BAibvq45WYuQw JjGDLYQvcyml0/kgTWsPbb27mj5uht0tR0RpgJpK0L39nu5mqep5sIlNHCo+K2Qb9gWyiatDCph 5w1gtDUPm7us+h8L2EFo2Sc83esEpXXBUMQ2/gONU9FP2L39Uzm5jSnNOPVLbuVX6WB4Hn1gkMe UoIdpY5b2IPd645k4RIWFbggTE5HQd8MCw79yhj6YuNwb0bWXtQES4sC4J2mYc+ysmQjnALPLU5 SPK+sx+7ms4g+QatQmWw9Cnh5HBZlidWeYQDAICGoOX9+2uMme/iVFR03gOg== X-Google-Smtp-Source: AGHT+IHJ74P6d7A+INVwA98f5iJflbVrWJao3lGug3xK3l6qx0+mauB/rdLhpcHFC1X7v1I1yEMgnw== X-Received: by 2002:a17:90b:4a4b:b0:33b:b0d6:6bc2 with SMTP id 98e67ed59e1d1-3403a2f2219mr1197121a91.30.1761705148273; Tue, 28 Oct 2025 19:32:28 -0700 (PDT) Received: from localhost.localdomain ([47.72.128.212]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-33fed81a4afsm13649518a91.19.2025.10.28.19.32.22 (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Tue, 28 Oct 2025 19:32:27 -0700 (PDT) From: Barry Song <21cnbao@gmail.com> To: Catalin Marinas , Will Deacon , Marek Szyprowski , Robin Murphy Cc: Barry Song , Ada Couprie Diaz , Ard Biesheuvel , Marc Zyngier , Anshuman Khandual , Ryan Roberts , Suren Baghdasaryan , Tangquan Zheng , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, iommu@lists.linux.dev Subject: [RFC PATCH 4/5] arm64: Provide arch_sync_dma_ batched helpers Date: Wed, 29 Oct 2025 10:31:14 +0800 Message-Id: <20251029023115.22809-5-21cnbao@gmail.com> X-Mailer: git-send-email 2.39.3 (Apple Git-146) In-Reply-To: <20251029023115.22809-1-21cnbao@gmail.com> References: <20251029023115.22809-1-21cnbao@gmail.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" From: Barry Song arch_sync_dma_for_device_batch_add() and arch_sync_dma_for_cpu_batch_add() batch the DMA sync operations, and arch_sync_dma_batch_flush() waits for their completion all together. Cc: Catalin Marinas Cc: Will Deacon Cc: Marek Szyprowski Cc: Robin Murphy Cc: Ada Couprie Diaz Cc: Ard Biesheuvel Cc: Marc Zyngier Cc: Anshuman Khandual Cc: Ryan Roberts Cc: Suren Baghdasaryan Cc: Tangquan Zheng Cc: linux-arm-kernel@lists.infradead.org Cc: linux-kernel@vger.kernel.org Cc: iommu@lists.linux.dev Signed-off-by: Barry Song --- arch/arm64/Kconfig | 1 + arch/arm64/mm/dma-mapping.c | 24 ++++++++++++++++++++++++ include/linux/dma-map-ops.h | 8 ++++++++ kernel/dma/Kconfig | 3 +++ 4 files changed, 36 insertions(+) diff --git a/arch/arm64/Kconfig b/arch/arm64/Kconfig index 6663ffd23f25..1ecf8a1c2458 100644 --- a/arch/arm64/Kconfig +++ b/arch/arm64/Kconfig @@ -112,6 +112,7 @@ config ARM64 select ARCH_SUPPORTS_SCHED_CLUSTER select ARCH_SUPPORTS_SCHED_MC select ARCH_WANT_BATCHED_UNMAP_TLB_FLUSH + select ARCH_WANT_BATCHED_DMA_SYNC select ARCH_WANT_COMPAT_IPC_PARSE_VERSION if COMPAT select ARCH_WANT_DEFAULT_BPF_JIT select ARCH_WANT_DEFAULT_TOPDOWN_MMAP_LAYOUT diff --git a/arch/arm64/mm/dma-mapping.c b/arch/arm64/mm/dma-mapping.c index b2b5792b2caa..9ac1ddd1bb9c 100644 --- a/arch/arm64/mm/dma-mapping.c +++ b/arch/arm64/mm/dma-mapping.c @@ -31,6 +31,30 @@ void arch_sync_dma_for_cpu(phys_addr_t paddr, size_t siz= e, dcache_inval_poc(start, start + size); } =20 +void arch_sync_dma_for_device_batch_add(phys_addr_t paddr, size_t size, + enum dma_data_direction dir) +{ + unsigned long start =3D (unsigned long)phys_to_virt(paddr); + + dcache_clean_poc_nosync(start, start + size); +} + +void arch_sync_dma_for_cpu_batch_add(phys_addr_t paddr, size_t size, + enum dma_data_direction dir) +{ + unsigned long start =3D (unsigned long)phys_to_virt(paddr); + + if (dir =3D=3D DMA_TO_DEVICE) + return; + + dcache_inval_poc_nosync(start, start + size); +} + +void arch_sync_dma_batch_flush(void) +{ + dsb(sy); +} + void arch_dma_prep_coherent(struct page *page, size_t size) { unsigned long start =3D (unsigned long)page_address(page); diff --git a/include/linux/dma-map-ops.h b/include/linux/dma-map-ops.h index 10882d00cb17..8fcd0a9c1f39 100644 --- a/include/linux/dma-map-ops.h +++ b/include/linux/dma-map-ops.h @@ -367,6 +367,14 @@ static inline void arch_sync_dma_for_cpu(phys_addr_t p= addr, size_t size, } #endif /* ARCH_HAS_SYNC_DMA_FOR_CPU */ =20 +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC +void arch_sync_dma_for_device_batch_add(phys_addr_t paddr, size_t size, + enum dma_data_direction dir); +void arch_sync_dma_for_cpu_batch_add(phys_addr_t paddr, size_t size, + enum dma_data_direction dir); +void arch_sync_dma_batch_flush(void); +#endif + #ifdef CONFIG_ARCH_HAS_SYNC_DMA_FOR_CPU_ALL void arch_sync_dma_for_cpu_all(void); #else diff --git a/kernel/dma/Kconfig b/kernel/dma/Kconfig index 31cfdb6b4bc3..2785099b2fa0 100644 --- a/kernel/dma/Kconfig +++ b/kernel/dma/Kconfig @@ -78,6 +78,9 @@ config ARCH_HAS_DMA_PREP_COHERENT config ARCH_HAS_FORCE_DMA_UNENCRYPTED bool =20 +config ARCH_WANT_BATCHED_DMA_SYNC + bool + # # Select this option if the architecture assumes DMA devices are coherent # by default. --=20 2.39.3 (Apple Git-146) From nobody Sat Feb 7 23:22:47 2026 Received: from mail-pj1-f41.google.com (mail-pj1-f41.google.com [209.85.216.41]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 46C49262D0C for ; Wed, 29 Oct 2025 02:32:39 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.41 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705161; cv=none; b=TzvstNN5KXFkZQhLzBMm8hIuSbItI5z6E0OmkA7N/onG5iO+2YysaCiBlYolv9XPth3Cv2KzM36TQZqdvHEmTOiiRPfmmF3gHMiT0SbVJ+8HOnloIblzGIjbAGvGwvcZPrE1xsQIx848C2kI5ZWcOJJK6/HQRGRCCWI+WJsMIU8= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761705161; c=relaxed/simple; bh=FhMdW1amZrDZOTYi1F2Y9ii4zVzoBonKSxbmPFkN9mo=; h=From:To:Cc:Subject:Date:Message-Id:In-Reply-To:References: MIME-Version; b=qmBSNQfuX1d7fKBiZjA2ImCpnnMZ/3IDMjrbTXYrshWb9cb1GPXMR/5OGAsVZ1DKL6Jzc3vwkjEKx84FMYmqsSMYHn4JYAC8lIYNxGr4lKNQuBep8iJ/KTZExfszFHpz6Gr/2HT9YIrvC6amQWdvOtoK3HWEmVgZYY5uknoExhA= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=lBXiuaGT; arc=none smtp.client-ip=209.85.216.41 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="lBXiuaGT" Received: by mail-pj1-f41.google.com with SMTP id 98e67ed59e1d1-33d7589774fso6412755a91.0 for ; Tue, 28 Oct 2025 19:32:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1761705159; x=1762309959; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=JXLeKP/DtOqQuL8m7vYar3R6RP6+fCAYk9I78/ihr2U=; b=lBXiuaGT54hz97+RNCy79FUgcfTEPw0U2nLD7NiKjdaqflviYv63UugwI8eGAs12Dv uHMLfyGH4I1waQCefV00arNVB/XFth6C/Inmcw0G62ffvyIsLOvykiRJwduDjw6D3OrW J3zaW5q2le7b8zU+kI/BWtOpk4ct6ONWp8Y9VeA9HS+ycu7fEOP7oP7/hxlhNAPxK1eB aVxcSxLrogRcHJYzT2hrCFCqKpWp7ylKltUtEGtET681hotNjLbSASwkFz0NeeAfmtYp FRS3qcsNHt3Wr/r2rozzyINdOqEsbWe4ONG9gfOTXmL95M7lIwFzVGlm6HPtWHwlkO4O eFBw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1761705159; x=1762309959; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=JXLeKP/DtOqQuL8m7vYar3R6RP6+fCAYk9I78/ihr2U=; b=mVxUv5dGo46fzkxbaUm8o3AQ0dm4YSFp82SQVemQD1MpehA7X8BxrG9WN02YwtJRu4 7tiYcWE+4jCoeOsA+VsY3+h7v2UE6FgokY4dVbbpsg2BdisgM/XBll8VxgvjUjFRpqbL EyS1QBWbmieGPGtQxFjgbV6NHnI+LKx4RFhyKjVAKVgszIvQ7rYepu6mMamADBQRDAcp hLkWJf2XOEdPFoO/68xwqizD3xe8VPmojvv9jYUTU1MGzJFaCahJ1n/KflhQ7Y7hkpjE Tm05GPruI6ETgJJGZbr0A0LNTsrxMdki66le2K1Vkl+UhfOujX9Te6FpF+hWYBfFNEHS Xygw== X-Forwarded-Encrypted: i=1; AJvYcCUciPkppnrRJzF1aBxrk3NYSAQoMtQEr64TUG4x3h8+dkb30D0c4j/X5Ly7S+iuNDBC2OOV2CyAFMpOB4c=@vger.kernel.org X-Gm-Message-State: AOJu0YycV2w5awUxv1drT94jv75+0cdar08uSESUMPurPFl5wRAYeD8N e1o7ziD656rXRrAbQjCdRzOPIkjDyexb6a+g+Wzzy7C2VHOPreNikDMz X-Gm-Gg: ASbGncuaPhnsO6dlnWl1xviyLdGCYfZs2OSdVYbp64aHoz1ZwF2G3htQKdVx/F0buyD FlTCBMmamA70huucHYsqSvxcNCUNb+exLSqz2HGuA9kRGcRLSW65J1eUyVOlmgPn0T/ev+jfTjj dgVDb4SHVutp9vurQcP9+N3P3X1D6OTj6WRSJhnX7hgqEEndtvQyibcrfz91gEoQLYsSN6Q94xj 0ColeaNfEZaQoqZ7iWpVGvSq/50gLEVe+U0UqluTgb/FaYqhi9JEa/CBKYEp9f6moS/3xYUOdWV FgX/BCd93E0qTNc0QT64ZEEILRD9kvy4K7d3yZyjsxMCT4eC4s/DU6yKj4RQfdDnVemq6Yj+l5i BusVC091XLaLLAoece7Gjgpst5lJKaG6hm2HjwmAG52vIAekw6LU0etOcxfxsfPD8IbK6MY2LoP O16jqmnIsxstrSnsQDJeTGWeNXVczZF4uLSoCTZLoCiSNXCfOOutx0ZTZeoA== X-Google-Smtp-Source: AGHT+IFhBNLyIOct1mE9x8NjCrvkWR0gN9M9H3c4aZoq5oEbAmNrnFsNADSTOspd44npw8jvwAKgSg== X-Received: by 2002:a17:90b:2549:b0:33f:ee05:56e1 with SMTP id 98e67ed59e1d1-3403a143527mr1244236a91.2.1761705159331; Tue, 28 Oct 2025 19:32:39 -0700 (PDT) Received: from localhost.localdomain ([47.72.128.212]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-33fed81a4afsm13649518a91.19.2025.10.28.19.32.33 (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Tue, 28 Oct 2025 19:32:38 -0700 (PDT) From: Barry Song <21cnbao@gmail.com> To: Catalin Marinas , Will Deacon , Marek Szyprowski , Robin Murphy Cc: Barry Song , Ada Couprie Diaz , Ard Biesheuvel , Marc Zyngier , Anshuman Khandual , Ryan Roberts , Suren Baghdasaryan , Tangquan Zheng , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, iommu@lists.linux.dev Subject: [RFC PATCH 5/5] dma-mapping: Allow batched DMA sync operations if supported by the arch Date: Wed, 29 Oct 2025 10:31:15 +0800 Message-Id: <20251029023115.22809-6-21cnbao@gmail.com> X-Mailer: git-send-email 2.39.3 (Apple Git-146) In-Reply-To: <20251029023115.22809-1-21cnbao@gmail.com> References: <20251029023115.22809-1-21cnbao@gmail.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" From: Barry Song This enables dma_direct_sync_sg_for_device, dma_direct_sync_sg_for_cpu, dma_direct_map_sg, and dma_direct_unmap_sg to use batched DMA sync operations when possible. This significantly improves performance on devices without hardware cache coherence. Tangquan's initial results show that batched synchronization can reduce dma_map_sg() time by 64.61% and dma_unmap_sg() time by 66.60% on an MTK phone platform (MediaTek Dimensity 9500). The tests were performed by pinning the task to CPU7 and fixing the CPU frequency at 2.6 GHz, running dma_map_sg() and dma_unmap_sg() on 10 MB buffers (10 MB / 4 KB sg entries per buffer) for 200 iterations and then averaging the results. Cc: Catalin Marinas Cc: Will Deacon Cc: Marek Szyprowski Cc: Robin Murphy Cc: Ada Couprie Diaz Cc: Ard Biesheuvel Cc: Marc Zyngier Cc: Anshuman Khandual Cc: Ryan Roberts Cc: Suren Baghdasaryan Cc: Tangquan Zheng Cc: linux-arm-kernel@lists.infradead.org Cc: linux-kernel@vger.kernel.org Cc: iommu@lists.linux.dev Signed-off-by: Barry Song --- kernel/dma/direct.c | 53 +++++++++++++++++++++++++--- kernel/dma/direct.h | 86 +++++++++++++++++++++++++++++++++++++++------ 2 files changed, 123 insertions(+), 16 deletions(-) diff --git a/kernel/dma/direct.c b/kernel/dma/direct.c index 1f9ee9759426..a0b45f84a91f 100644 --- a/kernel/dma/direct.c +++ b/kernel/dma/direct.c @@ -403,9 +403,16 @@ void dma_direct_sync_sg_for_device(struct device *dev, swiotlb_sync_single_for_device(dev, paddr, sg->length, dir); =20 if (!dev_is_dma_coherent(dev)) - arch_sync_dma_for_device(paddr, sg->length, - dir); +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + arch_sync_dma_for_device_batch_add(paddr, sg->length, dir); +#else + arch_sync_dma_for_device(paddr, sg->length, dir); +#endif } +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + if (!dev_is_dma_coherent(dev)) + arch_sync_dma_batch_flush(); +#endif } #endif =20 @@ -422,7 +429,11 @@ void dma_direct_sync_sg_for_cpu(struct device *dev, phys_addr_t paddr =3D dma_to_phys(dev, sg_dma_address(sg)); =20 if (!dev_is_dma_coherent(dev)) +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + arch_sync_dma_for_cpu_batch_add(paddr, sg->length, dir); +#else arch_sync_dma_for_cpu(paddr, sg->length, dir); +#endif =20 swiotlb_sync_single_for_cpu(dev, paddr, sg->length, dir); =20 @@ -430,8 +441,12 @@ void dma_direct_sync_sg_for_cpu(struct device *dev, arch_dma_mark_clean(paddr, sg->length); } =20 - if (!dev_is_dma_coherent(dev)) + if (!dev_is_dma_coherent(dev)) { arch_sync_dma_for_cpu_all(); +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + arch_sync_dma_batch_flush(); +#endif + } } =20 /* @@ -443,14 +458,29 @@ void dma_direct_unmap_sg(struct device *dev, struct s= catterlist *sgl, { struct scatterlist *sg; int i; +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + bool need_sync =3D false; +#endif =20 for_each_sg(sgl, sg, nents, i) { - if (sg_dma_is_bus_address(sg)) + if (sg_dma_is_bus_address(sg)) { sg_dma_unmark_bus_address(sg); - else + } else { +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + need_sync =3D true; + dma_direct_unmap_phys_batch_add(dev, sg->dma_address, + sg_dma_len(sg), dir, attrs); + +#else dma_direct_unmap_phys(dev, sg->dma_address, sg_dma_len(sg), dir, attrs); +#endif + } } +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + if (need_sync && !dev_is_dma_coherent(dev)) + arch_sync_dma_batch_flush(); +#endif } #endif =20 @@ -460,6 +490,9 @@ int dma_direct_map_sg(struct device *dev, struct scatte= rlist *sgl, int nents, struct pci_p2pdma_map_state p2pdma_state =3D {}; struct scatterlist *sg; int i, ret; +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + bool need_sync =3D false; +#endif =20 for_each_sg(sgl, sg, nents, i) { switch (pci_p2pdma_state(&p2pdma_state, dev, sg_page(sg))) { @@ -471,8 +504,14 @@ int dma_direct_map_sg(struct device *dev, struct scatt= erlist *sgl, int nents, */ break; case PCI_P2PDMA_MAP_NONE: +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + need_sync =3D true; + sg->dma_address =3D dma_direct_map_phys_batch_add(dev, sg_phys(sg), + sg->length, dir, attrs); +#else sg->dma_address =3D dma_direct_map_phys(dev, sg_phys(sg), sg->length, dir, attrs); +#endif if (sg->dma_address =3D=3D DMA_MAPPING_ERROR) { ret =3D -EIO; goto out_unmap; @@ -490,6 +529,10 @@ int dma_direct_map_sg(struct device *dev, struct scatt= erlist *sgl, int nents, sg_dma_len(sg) =3D sg->length; } =20 +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC + if (need_sync && !dev_is_dma_coherent(dev)) + arch_sync_dma_batch_flush(); +#endif return nents; =20 out_unmap: diff --git a/kernel/dma/direct.h b/kernel/dma/direct.h index da2fadf45bcd..a211bab26478 100644 --- a/kernel/dma/direct.h +++ b/kernel/dma/direct.h @@ -64,15 +64,11 @@ static inline void dma_direct_sync_single_for_device(st= ruct device *dev, arch_sync_dma_for_device(paddr, size, dir); } =20 -static inline void dma_direct_sync_single_for_cpu(struct device *dev, - dma_addr_t addr, size_t size, enum dma_data_direction dir) +static inline void __dma_direct_sync_single_for_cpu(struct device *dev, + phys_addr_t paddr, size_t size, enum dma_data_direction dir) { - phys_addr_t paddr =3D dma_to_phys(dev, addr); - - if (!dev_is_dma_coherent(dev)) { - arch_sync_dma_for_cpu(paddr, size, dir); + if (!dev_is_dma_coherent(dev)) arch_sync_dma_for_cpu_all(); - } =20 swiotlb_sync_single_for_cpu(dev, paddr, size, dir); =20 @@ -80,7 +76,31 @@ static inline void dma_direct_sync_single_for_cpu(struct= device *dev, arch_dma_mark_clean(paddr, size); } =20 -static inline dma_addr_t dma_direct_map_phys(struct device *dev, +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC +static inline void dma_direct_sync_single_for_cpu_batch_add(struct device = *dev, + dma_addr_t addr, size_t size, enum dma_data_direction dir) +{ + phys_addr_t paddr =3D dma_to_phys(dev, addr); + + if (!dev_is_dma_coherent(dev)) + arch_sync_dma_for_cpu_batch_add(paddr, size, dir); + + __dma_direct_sync_single_for_cpu(dev, paddr, size, dir); +} +#endif + +static inline void dma_direct_sync_single_for_cpu(struct device *dev, + dma_addr_t addr, size_t size, enum dma_data_direction dir) +{ + phys_addr_t paddr =3D dma_to_phys(dev, addr); + + if (!dev_is_dma_coherent(dev)) + arch_sync_dma_for_cpu(paddr, size, dir); + + __dma_direct_sync_single_for_cpu(dev, paddr, size, dir); +} + +static inline dma_addr_t __dma_direct_map_phys(struct device *dev, phys_addr_t phys, size_t size, enum dma_data_direction dir, unsigned long attrs) { @@ -108,9 +128,6 @@ static inline dma_addr_t dma_direct_map_phys(struct dev= ice *dev, } } =20 - if (!dev_is_dma_coherent(dev) && - !(attrs & (DMA_ATTR_SKIP_CPU_SYNC | DMA_ATTR_MMIO))) - arch_sync_dma_for_device(phys, size, dir); return dma_addr; =20 err_overflow: @@ -121,6 +138,53 @@ static inline dma_addr_t dma_direct_map_phys(struct de= vice *dev, return DMA_MAPPING_ERROR; } =20 +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC +static inline dma_addr_t dma_direct_map_phys_batch_add(struct device *dev, + phys_addr_t phys, size_t size, enum dma_data_direction dir, + unsigned long attrs) +{ + dma_addr_t dma_addr =3D __dma_direct_map_phys(dev, phys, size, dir, attrs= ); + + if (dma_addr !=3D DMA_MAPPING_ERROR && !dev_is_dma_coherent(dev) && + !(attrs & (DMA_ATTR_SKIP_CPU_SYNC | DMA_ATTR_MMIO))) + arch_sync_dma_for_device_batch_add(phys, size, dir); + + return dma_addr; +} +#endif + +static inline dma_addr_t dma_direct_map_phys(struct device *dev, + phys_addr_t phys, size_t size, enum dma_data_direction dir, + unsigned long attrs) +{ + dma_addr_t dma_addr =3D __dma_direct_map_phys(dev, phys, size, dir, attrs= ); + + if (dma_addr !=3D DMA_MAPPING_ERROR && !dev_is_dma_coherent(dev) && + !(attrs & (DMA_ATTR_SKIP_CPU_SYNC | DMA_ATTR_MMIO))) + arch_sync_dma_for_device(phys, size, dir); + + return dma_addr; +} + +#ifdef CONFIG_ARCH_WANT_BATCHED_DMA_SYNC +static inline void dma_direct_unmap_phys_batch_add(struct device *dev, dma= _addr_t addr, + size_t size, enum dma_data_direction dir, unsigned long attrs) +{ + phys_addr_t phys; + + if (attrs & DMA_ATTR_MMIO) + /* nothing to do: uncached and no swiotlb */ + return; + + phys =3D dma_to_phys(dev, addr); + if (!(attrs & DMA_ATTR_SKIP_CPU_SYNC)) + dma_direct_sync_single_for_cpu_batch_add(dev, addr, size, dir); + + swiotlb_tbl_unmap_single(dev, phys, size, dir, + attrs | DMA_ATTR_SKIP_CPU_SYNC); +} +#endif + static inline void dma_direct_unmap_phys(struct device *dev, dma_addr_t ad= dr, size_t size, enum dma_data_direction dir, unsigned long attrs) { --=20 2.39.3 (Apple Git-146)