From nobody Fri Jan 9 07:51:05 2026 Received: from mail-pj1-f41.google.com (mail-pj1-f41.google.com [209.85.216.41]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 7AB9027CCE0 for ; Fri, 26 Dec 2025 22:54:06 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.41 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1766789649; cv=none; b=RVHsbYRrHwtMGVwiZG8kOTpinBxm1TFM+8sSoz8H/SPwpNMw388YxLmQmMEbBabtS8qPvjDFmO5xKznAKuUePzWX3IXUgI4KdH/DZheZBWj4rv6lMndlHa2GZplKOnjAP1sZV6ZcQpX+XmsTbaJFLQb3nGVpn7sYwNPME6oSh7I= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1766789649; c=relaxed/simple; bh=y0GAlTukBhrwG28eorWp2ff1UapqeGDpOXnThHTS++U=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=XZvq+iOjPoUAcy9SuES3m4udfLBuxSJiy7LYkIwbFPNHJVjh1+A388MKE5HOIip8WE5/sUsL645CBJHYKeXXmblAdDpLbFURV8ga3/r7mx1Xi5p46/oB9H2Y9Fd1eej/ne7dY9cadcLgNzAWupW/jaDeE0lSh+OuXR/Hc4Lu7WM= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=QIrXQYVv; arc=none smtp.client-ip=209.85.216.41 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="QIrXQYVv" Received: by mail-pj1-f41.google.com with SMTP id 98e67ed59e1d1-34c2f335681so5787221a91.1 for ; Fri, 26 Dec 2025 14:54:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1766789645; x=1767394445; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=pcGdIc/9DJZ3V/5WVf7vUUbl5wTlCyMzFQWvUiTQ7kw=; b=QIrXQYVvSYtGXv5Y0DdvxVGajEHwy61NH/2Rt1UPnY4iwBAzadsIy9guUbStWgC1xN BoMucDZ4xr+lC6MW4b+pKwwCiwUTNzIKoufco+mSC/vqyiflGdDBLIeu/TFRCGUSKJMd 2G0BfwjVp7+DSGoQAmWCAvXGL81HlfrD3S/whZN2AizTztHULJFM12uBzqwA4JEwXyC7 Xctp2qba2IeVQlLwyhg9XmDmyqEF6gfgiBSsDbp418AJnQqUacKZKd4yGtsWlqHaLhzG 90J/I/4ZOAS3VS05K63IFQldtE9y9W3ouHD9z0K89SHEX8x5fU8j/q8yEWvVp1Ayj1pE WUlw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1766789645; x=1767394445; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-gg:x-gm-message-state:from :to:cc:subject:date:message-id:reply-to; bh=pcGdIc/9DJZ3V/5WVf7vUUbl5wTlCyMzFQWvUiTQ7kw=; b=gpwdoSh7jZUEPzZApeWrzSHgK4PhIDCFJ1CPIibQWoLcEDpahuSjl7CslwqiZbj4uK jONp8n7JliJHeYEZpDI8ZPiDu6CTna9c24a/AwvFz2Wx+oWLuNC5ALnDtn2WLp1ODW0O k4kYqfgdlBOaKsj9qvmz/GT5MaFkLYF8vD6Mf0TD1/es9xSH1vMPFkFyXMLfZmq+rFyJ 3+cCw41JpTtqXhTA1Q8rz9AQn2WmGcVtEIBGOMegNTmdq1BadLBNniPrA+fk1w+2g1LP ERAGxi2x8vCHipDK8ysHqf3zkEF/+GrnisNfVf3p48w7cx+hJ8xEENvwIPEojXwrVuaz L+UQ== X-Gm-Message-State: AOJu0YyyOPEmeb71bqontPSa3/iIbR4qUyz1LopMtdVTKjZGfChmsHwF /wGDTIZFt5vsnpc5HKcK+eaL8CaGBTZH4zeJd5gnVRsneAYA/+AEmVzG X-Gm-Gg: AY/fxX6QGTESOiFnHn+g0Nr/QcOxaqzAnXYFYddv2cI7c0DdlTkAhfe6gOui+OLFR/b NU9XT2kS2r6Htu2uEV57BpcdpGCKwBIQ8HCQhN6rz1yW46c8Da7hKuRMMXw19RpUulc/KbcFbAJ 6F12kVGxFq9GKLUKio+v8ozjAI6OrjwIgw9KtScuk93isEniRwjAifvLTqdoV0K0bLHv7q4YVNq YPYIGT8Js9VnStNYctSwcjdbswqT1QqQc+sSsuvzZPcToWYp1BiBls1xs4jkwlWHH6a7VbF1pFx psyr/I5EAtce5ZCvr+LpVF7L0yjUA5HeA2hf8eMHnpogZmZs1r8FPvw5uPtravAmr2ks5Ye/kSn ZP1qDgDIYDq3ySy96QF/7As9EUJReD+MaBaaViXbjcey2IMXN7IWdVZRPPKZ0as8l/kARbgbkK3 kPOZ1WfLcn45k8 X-Google-Smtp-Source: AGHT+IE+oJ1Ft1XOQ1XU6OeA4rerw4thrh1Pz4yYg+rxMS7HW6pD48/QipbV6W5OVknyxXIG+nvg1w== X-Received: by 2002:a17:90b:5804:b0:34a:b459:bd10 with SMTP id 98e67ed59e1d1-34e921bf172mr21139744a91.24.1766789644533; Fri, 26 Dec 2025 14:54:04 -0800 (PST) Received: from barry-desktop.hub ([47.72.129.29]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-34e772ac1acsm9981428a91.9.2025.12.26.14.53.57 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 26 Dec 2025 14:54:03 -0800 (PST) From: Barry Song <21cnbao@gmail.com> To: catalin.marinas@arm.com, m.szyprowski@samsung.com, robin.murphy@arm.com, will@kernel.org, iommu@lists.linux.dev, linux-arm-kernel@lists.infradead.org Cc: linux-kernel@vger.kernel.org, xen-devel@lists.xenproject.org, Barry Song , Leon Romanovsky , Ada Couprie Diaz , Ard Biesheuvel , Marc Zyngier , Anshuman Khandual , Ryan Roberts , Suren Baghdasaryan , Tangquan Zheng Subject: [PATCH v2 6/8] dma-mapping: Support batch mode for dma_direct_{map,unmap}_sg Date: Sat, 27 Dec 2025 11:52:46 +1300 Message-ID: <20251226225254.46197-7-21cnbao@gmail.com> X-Mailer: git-send-email 2.48.1 In-Reply-To: <20251226225254.46197-1-21cnbao@gmail.com> References: <20251226225254.46197-1-21cnbao@gmail.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" From: Barry Song Leon suggested extending a flush argument to dma_direct_unmap_phys(), dma_direct_map_phys(), and dma_direct_sync_single_for_cpu(). For single-buffer cases, this would use flush=3Dtrue, while for SG cases flush=3Dfalse would be used, followed by a single flush after all cache operations are issued in dma_direct_{map,unmap}_sg(). This ultimately benefits dma_map_sg() and dma_unmap_sg(). Cc: Leon Romanovsky Cc: Catalin Marinas Cc: Will Deacon Cc: Marek Szyprowski Cc: Robin Murphy Cc: Ada Couprie Diaz Cc: Ard Biesheuvel Cc: Marc Zyngier Cc: Anshuman Khandual Cc: Ryan Roberts Cc: Suren Baghdasaryan Cc: Tangquan Zheng Signed-off-by: Barry Song Reviewed-by: Leon Romanovsky --- kernel/dma/direct.c | 17 +++++++++++++---- kernel/dma/direct.h | 16 ++++++++++------ kernel/dma/mapping.c | 6 +++--- 3 files changed, 26 insertions(+), 13 deletions(-) diff --git a/kernel/dma/direct.c b/kernel/dma/direct.c index 98bacf562ca1..550a1a13148d 100644 --- a/kernel/dma/direct.c +++ b/kernel/dma/direct.c @@ -447,14 +447,19 @@ void dma_direct_unmap_sg(struct device *dev, struct s= catterlist *sgl, { struct scatterlist *sg; int i; + bool need_sync =3D false; =20 for_each_sg(sgl, sg, nents, i) { - if (sg_dma_is_bus_address(sg)) + if (sg_dma_is_bus_address(sg)) { sg_dma_unmark_bus_address(sg); - else + } else { + need_sync =3D true; dma_direct_unmap_phys(dev, sg->dma_address, - sg_dma_len(sg), dir, attrs); + sg_dma_len(sg), dir, attrs, false); + } } + if (need_sync && !dev_is_dma_coherent(dev)) + arch_sync_dma_flush(); } #endif =20 @@ -464,6 +469,7 @@ int dma_direct_map_sg(struct device *dev, struct scatte= rlist *sgl, int nents, struct pci_p2pdma_map_state p2pdma_state =3D {}; struct scatterlist *sg; int i, ret; + bool need_sync =3D false; =20 for_each_sg(sgl, sg, nents, i) { switch (pci_p2pdma_state(&p2pdma_state, dev, sg_page(sg))) { @@ -475,8 +481,9 @@ int dma_direct_map_sg(struct device *dev, struct scatte= rlist *sgl, int nents, */ break; case PCI_P2PDMA_MAP_NONE: + need_sync =3D true; sg->dma_address =3D dma_direct_map_phys(dev, sg_phys(sg), - sg->length, dir, attrs); + sg->length, dir, attrs, false); if (sg->dma_address =3D=3D DMA_MAPPING_ERROR) { ret =3D -EIO; goto out_unmap; @@ -495,6 +502,8 @@ int dma_direct_map_sg(struct device *dev, struct scatte= rlist *sgl, int nents, sg_dma_len(sg) =3D sg->length; } =20 + if (need_sync && !dev_is_dma_coherent(dev)) + arch_sync_dma_flush(); return nents; =20 out_unmap: diff --git a/kernel/dma/direct.h b/kernel/dma/direct.h index a69326eed266..d4ad79828090 100644 --- a/kernel/dma/direct.h +++ b/kernel/dma/direct.h @@ -67,13 +67,15 @@ static inline void dma_direct_sync_single_for_device(st= ruct device *dev, } =20 static inline void dma_direct_sync_single_for_cpu(struct device *dev, - dma_addr_t addr, size_t size, enum dma_data_direction dir) + dma_addr_t addr, size_t size, enum dma_data_direction dir, + bool flush) { phys_addr_t paddr =3D dma_to_phys(dev, addr); =20 if (!dev_is_dma_coherent(dev)) { arch_sync_dma_for_cpu(paddr, size, dir); - arch_sync_dma_flush(); + if (flush) + arch_sync_dma_flush(); arch_sync_dma_for_cpu_all(); } =20 @@ -85,7 +87,7 @@ static inline void dma_direct_sync_single_for_cpu(struct = device *dev, =20 static inline dma_addr_t dma_direct_map_phys(struct device *dev, phys_addr_t phys, size_t size, enum dma_data_direction dir, - unsigned long attrs) + unsigned long attrs, bool flush) { dma_addr_t dma_addr; =20 @@ -114,7 +116,8 @@ static inline dma_addr_t dma_direct_map_phys(struct dev= ice *dev, if (!dev_is_dma_coherent(dev) && !(attrs & (DMA_ATTR_SKIP_CPU_SYNC | DMA_ATTR_MMIO))) { arch_sync_dma_for_device(phys, size, dir); - arch_sync_dma_flush(); + if (flush) + arch_sync_dma_flush(); } return dma_addr; =20 @@ -127,7 +130,8 @@ static inline dma_addr_t dma_direct_map_phys(struct dev= ice *dev, } =20 static inline void dma_direct_unmap_phys(struct device *dev, dma_addr_t ad= dr, - size_t size, enum dma_data_direction dir, unsigned long attrs) + size_t size, enum dma_data_direction dir, unsigned long attrs, + bool flush) { phys_addr_t phys; =20 @@ -137,7 +141,7 @@ static inline void dma_direct_unmap_phys(struct device = *dev, dma_addr_t addr, =20 phys =3D dma_to_phys(dev, addr); if (!(attrs & DMA_ATTR_SKIP_CPU_SYNC)) - dma_direct_sync_single_for_cpu(dev, addr, size, dir); + dma_direct_sync_single_for_cpu(dev, addr, size, dir, flush); =20 swiotlb_tbl_unmap_single(dev, phys, size, dir, attrs | DMA_ATTR_SKIP_CPU_SYNC); diff --git a/kernel/dma/mapping.c b/kernel/dma/mapping.c index 37163eb49f9f..d8cfa56a3cbb 100644 --- a/kernel/dma/mapping.c +++ b/kernel/dma/mapping.c @@ -166,7 +166,7 @@ dma_addr_t dma_map_phys(struct device *dev, phys_addr_t= phys, size_t size, =20 if (dma_map_direct(dev, ops) || (!is_mmio && arch_dma_map_phys_direct(dev, phys + size))) - addr =3D dma_direct_map_phys(dev, phys, size, dir, attrs); + addr =3D dma_direct_map_phys(dev, phys, size, dir, attrs, true); else if (use_dma_iommu(dev)) addr =3D iommu_dma_map_phys(dev, phys, size, dir, attrs); else if (ops->map_phys) @@ -207,7 +207,7 @@ void dma_unmap_phys(struct device *dev, dma_addr_t addr= , size_t size, BUG_ON(!valid_dma_direction(dir)); if (dma_map_direct(dev, ops) || (!is_mmio && arch_dma_unmap_phys_direct(dev, addr + size))) - dma_direct_unmap_phys(dev, addr, size, dir, attrs); + dma_direct_unmap_phys(dev, addr, size, dir, attrs, true); else if (use_dma_iommu(dev)) iommu_dma_unmap_phys(dev, addr, size, dir, attrs); else if (ops->unmap_phys) @@ -373,7 +373,7 @@ void __dma_sync_single_for_cpu(struct device *dev, dma_= addr_t addr, size_t size, =20 BUG_ON(!valid_dma_direction(dir)); if (dma_map_direct(dev, ops)) - dma_direct_sync_single_for_cpu(dev, addr, size, dir); + dma_direct_sync_single_for_cpu(dev, addr, size, dir, true); else if (use_dma_iommu(dev)) iommu_dma_sync_single_for_cpu(dev, addr, size, dir); else if (ops->sync_single_for_cpu) --=20 2.43.0