From nobody Wed Feb 11 02:05:58 2026 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 8024A25A34D for ; Mon, 10 Feb 2025 19:38:39 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=170.10.129.124 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1739216321; cv=none; b=PXQtRdfvDL6V8flmj06WZTC4oBUTZAeTKLyPIEiMh4qJHLCYy5o/zruy78eieYxyHj3VTZ/gLEJ/sTdw8dByyYpBWSMufFqyTHGD8likyyeO3G4fD0Mp8ghvuXpuWFuFsEe7syp4chFaMaRtbNz1MfgIUEb4hIRiihnK4ycc7MU= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1739216321; c=relaxed/simple; bh=76d5pEBVKf4Yijczbwj7KuBibjsq0aAlESoqsTf2wbw=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=PLENwHrzThErYz5aNEiqVtM8Ip/CM/ASMLL5tMMLzuiOE9T7zbGevgzYMtkmMBgVRo9P5DHQ0T5CbH68duXl5+yvFytDkErRCx6wjSSbMhkhSrUYUThLOS9UzSJ/PVTmy9kpeV+I/sobGPmyoZD/bBcdwVcLgsjUnYKyEXRy3UA= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com; spf=pass smtp.mailfrom=redhat.com; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b=es0bOGsN; arc=none smtp.client-ip=170.10.129.124 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=redhat.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="es0bOGsN" DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1739216318; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=8sC+CKPivo9nJ2nwAVN+E4SAgEKJPEiCVQrD/mzwcqM=; b=es0bOGsN+emNWNv0iKtdB+/awn1NRoHB7YrfChy+rLlYA5DTzw/IxgRvietxiTkm8QeMQJ PAAYFe9CnfSBFi2IOFCsoMSOVm2xD9R1eH2egtajc9Z9ybOlUE+b/drXUagETlqCbzaihq 1k1rfy1IcAwFoqUrQWAAeKi5VLJGMC4= Received: from mail-wm1-f71.google.com (mail-wm1-f71.google.com [209.85.128.71]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-339-Xjc4f11yOXe_gRn-d1PJUA-1; Mon, 10 Feb 2025 14:38:37 -0500 X-MC-Unique: Xjc4f11yOXe_gRn-d1PJUA-1 X-Mimecast-MFC-AGG-ID: Xjc4f11yOXe_gRn-d1PJUA Received: by mail-wm1-f71.google.com with SMTP id 5b1f17b1804b1-4392fc6bceaso13133485e9.2 for ; Mon, 10 Feb 2025 11:38:36 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1739216316; x=1739821116; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=8sC+CKPivo9nJ2nwAVN+E4SAgEKJPEiCVQrD/mzwcqM=; b=vxFtFf+OHNqNG1k2lYO0yHmCR8JfwuLtCMJ7MB8byh3DzmWMOXLoOO5T7CcbJhmSSD 50ZY850iYdIP3q21oBm5TV1j8zqJmGeU680ZuzdiwsyFlmCPjTAEiaX0nCj4EckyF4bm 1ERdxSNLGbFeUQYbOFQ1zl8QS1qZLG6LsxM7wig0KDUpPoeX/eHBOm5c2L1nJG4v0bcr jUBKhE6gkNIhmaVnZAK29XOwq65L0D2qWXQw/gaGlrkb5YQS4XMqlAzvPq3jDfX3tvrL zgyC50LgR6c3T1PXXTZnfpSk7KrqjPnruCTus1VutXgY+v2ppQbl9EDXIysh0/1UwkHG iDqw== X-Gm-Message-State: AOJu0YzPzphyIRaxPag6sdKuNyw+nKyJhdPEE4Gk/3PZ3ZlmfhhGRPOz EqEazLwgFZygGJ1ryMVlekk2HfASp4iCXWG7NwbdgO6i7ZkXrdwsjXJbVZRTrVm/gDIJd5erhpa WNCaWf/qOI0XQbcAWXgdV18nkPi2gcyf9zEJzDYnwIIOnHSQeYl1S6dN3CaGAuDVuuFY/Uj7bA5 qi5IIQLT31wwCVJdbMw6qNr6qUkNPstud3Ht6gX1SeFmQE X-Gm-Gg: ASbGnctPsfNrOxuQHUhUl4Mx8AE6cCPOK/PuItCheqjfctTEpvS6p/ElygWNMkUBwyM Da0TdL4Rp1lxEZgCOn2RQrGI3ufwGTy2Hb4jQVj+dNGxmKnBwbKliklf44zasWsqZhW049eTsxM ftkB3lC6ckcnhsrR0B8vnpjEqJwgm/H45/ed/4rCnkzRcARHolKJmQk/BbFRpZ0yjcwQRUxYc2l 0U8h6+vZ/dBar866OgbrYn3zEVa4ftvebgBIbKoitLWLiE8hw2dozk0KpN75qPJXRatPKBv//ST J7cpZ8uaCuhqrV/CjORXUtPtj54/pKfeC1lAB556aMS+NleMPGx/p9q4SS/nRQdKYQ== X-Received: by 2002:a05:600c:4e91:b0:439:4637:9d9 with SMTP id 5b1f17b1804b1-43946370d97mr43287695e9.12.1739216315807; Mon, 10 Feb 2025 11:38:35 -0800 (PST) X-Google-Smtp-Source: AGHT+IF6B7q2dhfsb9dj8LY+8M1CJU6DFUQTZK7m2VERaPDmly8AB+qIiZRItOcPeyw5uy4LKosd2g== X-Received: by 2002:a05:600c:4e91:b0:439:4637:9d9 with SMTP id 5b1f17b1804b1-43946370d97mr43287075e9.12.1739216315147; Mon, 10 Feb 2025 11:38:35 -0800 (PST) Received: from localhost (p200300cbc734b80012c465cd348aaee6.dip0.t-ipconnect.de. [2003:cb:c734:b800:12c4:65cd:348a:aee6]) by smtp.gmail.com with UTF8SMTPSA id 5b1f17b1804b1-4390d94d802sm195260345e9.12.2025.02.10.11.38.31 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 10 Feb 2025 11:38:33 -0800 (PST) From: David Hildenbrand To: linux-kernel@vger.kernel.org Cc: linux-doc@vger.kernel.org, dri-devel@lists.freedesktop.org, linux-mm@kvack.org, nouveau@lists.freedesktop.org, linux-trace-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org, damon@lists.linux.dev, David Hildenbrand , Andrew Morton , =?UTF-8?q?J=C3=A9r=C3=B4me=20Glisse?= , Jonathan Corbet , Alex Shi , Yanteng Si , Karol Herbst , Lyude Paul , Danilo Krummrich , David Airlie , Simona Vetter , Masami Hiramatsu , Oleg Nesterov , Peter Zijlstra , SeongJae Park , "Liam R. Howlett" , Lorenzo Stoakes , Vlastimil Babka , Jann Horn , Pasha Tatashin , Peter Xu , Alistair Popple , Jason Gunthorpe Subject: [PATCH v2 08/17] kernel/events/uprobes: handle device-exclusive entries correctly in __replace_page() Date: Mon, 10 Feb 2025 20:37:50 +0100 Message-ID: <20250210193801.781278-9-david@redhat.com> X-Mailer: git-send-email 2.48.1 In-Reply-To: <20250210193801.781278-1-david@redhat.com> References: <20250210193801.781278-1-david@redhat.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Ever since commit b756a3b5e7ea ("mm: device exclusive memory access") we can return with a device-exclusive entry from page_vma_mapped_walk(). __replace_page() is not prepared for that, so teach it about these PFN swap PTEs. Note that device-private entries are so far not applicable on that path, because GUP would never have returned such folios (conversion to device-private happens by page migration, not in-place conversion of the PTE). There is a race between GUP and us locking the folio to look it up using page_vma_mapped_walk(), so this is likely a fix (unless something else could prevent that race, but it doesn't look like). pte_pfn() on something that is not a present pte could give use garbage, and we'd wrongly mess up the mapcount because it was already adjusted by calling folio_remove_rmap_pte() when making the entry device-exclusive. Fixes: b756a3b5e7ea ("mm: device exclusive memory access") Signed-off-by: David Hildenbrand --- kernel/events/uprobes.c | 13 ++++++++++++- 1 file changed, 12 insertions(+), 1 deletion(-) diff --git a/kernel/events/uprobes.c b/kernel/events/uprobes.c index 2ca797cbe465f..cd6105b100325 100644 --- a/kernel/events/uprobes.c +++ b/kernel/events/uprobes.c @@ -173,6 +173,7 @@ static int __replace_page(struct vm_area_struct *vma, u= nsigned long addr, DEFINE_FOLIO_VMA_WALK(pvmw, old_folio, vma, addr, 0); int err; struct mmu_notifier_range range; + pte_t pte; =20 mmu_notifier_range_init(&range, MMU_NOTIFY_CLEAR, 0, mm, addr, addr + PAGE_SIZE); @@ -192,6 +193,16 @@ static int __replace_page(struct vm_area_struct *vma, = unsigned long addr, if (!page_vma_mapped_walk(&pvmw)) goto unlock; VM_BUG_ON_PAGE(addr !=3D pvmw.address, old_page); + pte =3D ptep_get(pvmw.pte); + + /* + * Handle PFN swap PTES, such as device-exclusive ones, that actually + * map pages: simply trigger GUP again to fix it up. + */ + if (unlikely(!pte_present(pte))) { + page_vma_mapped_walk_done(&pvmw); + goto unlock; + } =20 if (new_page) { folio_get(new_folio); @@ -206,7 +217,7 @@ static int __replace_page(struct vm_area_struct *vma, u= nsigned long addr, inc_mm_counter(mm, MM_ANONPAGES); } =20 - flush_cache_page(vma, addr, pte_pfn(ptep_get(pvmw.pte))); + flush_cache_page(vma, addr, pte_pfn(pte)); ptep_clear_flush(vma, addr, pvmw.pte); if (new_page) set_pte_at(mm, addr, pvmw.pte, --=20 2.48.1