From nobody Thu Nov 14 16:49:30 2024 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9060EC6FA82 for ; Tue, 27 Sep 2022 17:05:45 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232345AbiI0RFm (ORCPT ); Tue, 27 Sep 2022 13:05:42 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59460 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232210AbiI0RFc (ORCPT ); Tue, 27 Sep 2022 13:05:32 -0400 Received: from NAM11-CO1-obe.outbound.protection.outlook.com (mail-co1nam11on2076.outbound.protection.outlook.com [40.107.220.76]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6AD4B57E3A for ; Tue, 27 Sep 2022 10:05:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=nSdkxCFmTxWURgBDGQJilT8ynGgz4G1fFXjxsIpUJnlOMUNOSBTx+IFr5XUgVIfypu4xXe82gIXp6GD4X8PpJBFcelQSOr/n07GoIDuK4VHwGXplNUNZMzpsQfK+lBqFKtBtipBRzVLminOdsHbKXk8HF7dOqQlb9KJ41ZQhqdwGXHofv+t7/dH0HpQJgRYJq3/1fsgzaIBS+B0cFTNcRLnVLnGIORv80rPYYjxg4RLh3qGCUql8OYqqHj69nAGtBA9lpXEWFJlsQ+csUG+2Jc4elGO4CaBytO5sa/otRsNP0To+AuitSuKt+RXR5ZTQ6O7DfHXv4guG/fk2+btGSQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=JKUG239Q84D0CNl9h8UMesh0sB8LZBUU9z+i77Al2Eo=; b=Ym9ZmxPXXmwNvBbGFlqpCmE1+TSIwevHfTO1QYBICvEZd34zduGPUeLenVGOymy9qd4PPwMDm0e+Aelkio6J0bQq2V9ru4/TtsMIPaXtZmpc1TDWEUAnDUHp6eBogVUtb6RzW1w6O/8JYNZD+r3o5Ccq3Dk0/0xR5hERknqXmyfblEiHj+PZF1XYD49C/S+yh1slrlSlrVAbBgFELnQ359TovJcEYakYzg71xE20K/jOj1Nz3m1q46ta2Z06TG9WwTQ6lyGBnKA9o7L5wV2ketlmj7XNKSjskBFzMzrxRtSLUYd2WoiF75Kgn4H5lb32p6oJFWoz6HcgsJFylYNp+g== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 165.204.84.17) smtp.rcpttodomain=vger.kernel.org smtp.mailfrom=amd.com; dmarc=pass (p=quarantine sp=quarantine pct=100) action=none header.from=amd.com; dkim=none (message not signed); arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amd.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=JKUG239Q84D0CNl9h8UMesh0sB8LZBUU9z+i77Al2Eo=; b=JctAJhvHGs+JIt0G0mRV2FPjXAfg3fSjs8Pna6edYcFgg0w+EVNjnvhHqpKDLlzMy2IJmEVLuKOls2cW/torF2q9nltx7HtepZ+If/ay3yCMk9f5+Vtr21xyNcImzEE79IOazc5OhlqzIqNUhj5EFvw8tqKZc0yD8rnCO0mycvY= Received: from BN1PR14CA0008.namprd14.prod.outlook.com (2603:10b6:408:e3::13) by CY5PR12MB6646.namprd12.prod.outlook.com (2603:10b6:930:41::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5676.17; Tue, 27 Sep 2022 17:05:14 +0000 Received: from BN8NAM11FT039.eop-nam11.prod.protection.outlook.com (2603:10b6:408:e3:cafe::1e) by BN1PR14CA0008.outlook.office365.com (2603:10b6:408:e3::13) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5676.17 via Frontend Transport; Tue, 27 Sep 2022 17:05:14 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 165.204.84.17) smtp.mailfrom=amd.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=amd.com; Received-SPF: Pass (protection.outlook.com: domain of amd.com designates 165.204.84.17 as permitted sender) receiver=protection.outlook.com; client-ip=165.204.84.17; helo=SATLEXMB04.amd.com; pr=C Received: from SATLEXMB04.amd.com (165.204.84.17) by BN8NAM11FT039.mail.protection.outlook.com (10.13.177.169) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.20.5654.14 via Frontend Transport; Tue, 27 Sep 2022 17:05:14 +0000 Received: from tlendack-t1.amd.com (10.180.168.240) by SATLEXMB04.amd.com (10.181.40.145) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.28; Tue, 27 Sep 2022 12:05:12 -0500 From: Tom Lendacky To: , CC: Thomas Gleixner , Ingo Molnar , Borislav Petkov , Dave Hansen , "Kirill A. Shutemov" , "H. Peter Anvin" , Michael Roth , Joerg Roedel , Andy Lutomirski , Peter Zijlstra Subject: [PATCH v5 5/6] x86/sev: Use large PSC requests if applicable Date: Tue, 27 Sep 2022 12:04:20 -0500 Message-ID: <632a4d3c7fa2f30d2d0d1c442b18d556f85c3449.1664298261.git.thomas.lendacky@amd.com> X-Mailer: git-send-email 2.37.3 In-Reply-To: References: <20220614120231.48165-1-kirill.shutemov@linux.intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-Originating-IP: [10.180.168.240] X-ClientProxiedBy: SATLEXMB03.amd.com (10.181.40.144) To SATLEXMB04.amd.com (10.181.40.145) X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: BN8NAM11FT039:EE_|CY5PR12MB6646:EE_ X-MS-Office365-Filtering-Correlation-Id: 08f97a97-6a1c-4521-38d9-08daa0aa71dc X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: 1hxa/O0KDh/vhs8rn8eipvHyGJ/LeskVpCXugb45qm/LnMxwGCleKDnXsc4d8CODsijVwD9iYh6BELtwtl7ozIm5oS16Y+lNyp2fURjTtXu804zcrIywi5oLZp0MsfuKnc1Ddc5ehLfASZSEK1adl9utD83VMmOyFNYG9q4id9dsat0jRwfGVizFW7Vio0/UqTNVisqH1d+svISI4j7AqS87CRXPCdKc5oZNj/Lu8Z6RAea0dLzCT+1j0S+2SrZjxgrhfN9mDA+vsnkOPUkYs1EiDmOO1KUkS3Gxhft7ue76Dlby6EyGzJu+RmQH8J5qQhABfrmuQG767GdiibndT407Q9AgYbxaMYKTMj4koT2veBz8eYsXWy4kT2rXMb1gWPNbqUxY2WwkH2nlUDHx6FK/+f2esh4NfVXlbq7ts+mPprcj000Fo+TM+zT93TVdB5O5j1ZIKEfrDUpJjxnyh1Zkc5eTUHRlAxy/fz1tf4LJAvLpWUgyFk81jGLWn+CLNtUQ5kJcObFTQKiTX2QiMQrukoPDQwaxPqxY1h40uPqXQcr0EFxG8T8b7JTMAmQyyEGw051skfaeAFVrfHb3nFEqX0jUEK25RwEeApeX1QhhN4gEKAToxc21A4yW7GUgzSmI9dlTaGzvBbP/GdZdtopJQUDv69B2uXBP1Q+px1G43TIxnrDi+bi1rB0zGzC/Z/I1JU48r99JSLq/IIOTUe2ZTO1XwSA5XdcHekW0lDxE2yzggYf60X3v/pQB8a3GRp6vgTjd+fN72hV91ZrZh75yePk9SXC0Ai994VZ2XYA= X-Forefront-Antispam-Report: CIP:165.204.84.17;CTRY:US;LANG:en;SCL:1;SRV:;IPV:CAL;SFV:NSPM;H:SATLEXMB04.amd.com;PTR:InfoDomainNonexistent;CAT:NONE;SFS:(13230022)(4636009)(346002)(396003)(39860400002)(136003)(376002)(451199015)(40470700004)(36840700001)(46966006)(82310400005)(40460700003)(478600001)(7416002)(6666004)(356005)(5660300002)(2616005)(36860700001)(26005)(86362001)(8936002)(316002)(336012)(36756003)(7696005)(16526019)(110136005)(186003)(54906003)(82740400003)(70586007)(4326008)(70206006)(81166007)(40480700001)(41300700001)(2906002)(426003)(8676002)(47076005)(83380400001)(36900700001);DIR:OUT;SFP:1101; X-OriginatorOrg: amd.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 27 Sep 2022 17:05:14.0707 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 08f97a97-6a1c-4521-38d9-08daa0aa71dc X-MS-Exchange-CrossTenant-Id: 3dd8961f-e488-4e60-8e11-a82d994e183d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=3dd8961f-e488-4e60-8e11-a82d994e183d;Ip=[165.204.84.17];Helo=[SATLEXMB04.amd.com] X-MS-Exchange-CrossTenant-AuthSource: BN8NAM11FT039.eop-nam11.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: CY5PR12MB6646 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Type: text/plain; charset="utf-8" In advance of providing support for unaccepted memory, request 2M Page State Change (PSC) requests when the address range allows for it. By using a 2M page size, more PSC operations can be handled in a single request to the hypervisor. The hypervisor will determine if it can accommodate the larger request by checking the mapping in the nested page table. If mapped as a large page, then the 2M page request can be performed, otherwise the 2M page request will be broken down into 512 4K page requests. This is still more efficient than having the guest perform multiple PSC requests in order to process the 512 4K pages. In conjunction with the 2M PSC requests, attempt to perform the associated PVALIDATE instruction of the page using the 2M page size. If PVALIDATE fails with a size mismatch, then fallback to validating 512 4K pages. To do this, page validation is modified to work with the PSC structure and not just a virtual address range. Signed-off-by: Tom Lendacky --- arch/x86/include/asm/sev.h | 4 ++ arch/x86/kernel/sev.c | 125 ++++++++++++++++++++++++------------- 2 files changed, 84 insertions(+), 45 deletions(-) diff --git a/arch/x86/include/asm/sev.h b/arch/x86/include/asm/sev.h index 19514524f0f8..0007ab04ac5f 100644 --- a/arch/x86/include/asm/sev.h +++ b/arch/x86/include/asm/sev.h @@ -79,11 +79,15 @@ extern void vc_no_ghcb(void); extern void vc_boot_ghcb(void); extern bool handle_vc_boot_ghcb(struct pt_regs *regs); =20 +/* PVALIDATE return codes */ +#define PVALIDATE_FAIL_SIZEMISMATCH 6 + /* Software defined (when rFlags.CF =3D 1) */ #define PVALIDATE_FAIL_NOUPDATE 255 =20 /* RMP page size */ #define RMP_PG_SIZE_4K 0 +#define RMP_PG_SIZE_2M 1 =20 #define RMPADJUST_VMSA_PAGE_BIT BIT(16) =20 diff --git a/arch/x86/kernel/sev.c b/arch/x86/kernel/sev.c index 0b958d77abb4..eabb8dd5be5b 100644 --- a/arch/x86/kernel/sev.c +++ b/arch/x86/kernel/sev.c @@ -655,32 +655,58 @@ static u64 __init get_jump_table_addr(void) return ret; } =20 -static void pvalidate_pages(unsigned long vaddr, unsigned int npages, bool= validate) +static void pvalidate_pages(struct snp_psc_desc *desc) { - unsigned long vaddr_end; + struct psc_entry *e; + unsigned long vaddr; + unsigned int size; + unsigned int i; + bool validate; int rc; =20 - vaddr =3D vaddr & PAGE_MASK; - vaddr_end =3D vaddr + ((unsigned long)npages << PAGE_SHIFT); + for (i =3D 0; i <=3D desc->hdr.end_entry; i++) { + e =3D &desc->entries[i]; + + vaddr =3D (unsigned long)pfn_to_kaddr(e->gfn); + size =3D e->pagesize ? RMP_PG_SIZE_2M : RMP_PG_SIZE_4K; + validate =3D (e->operation =3D=3D SNP_PAGE_STATE_PRIVATE) ? true : false; + + rc =3D pvalidate(vaddr, size, validate); + if (rc =3D=3D PVALIDATE_FAIL_SIZEMISMATCH && size =3D=3D RMP_PG_SIZE_2M)= { + unsigned long vaddr_end =3D vaddr + PMD_PAGE_SIZE; + + for (; vaddr < vaddr_end; vaddr +=3D PAGE_SIZE) { + rc =3D pvalidate(vaddr, RMP_PG_SIZE_4K, validate); + if (rc) + break; + } + } =20 - while (vaddr < vaddr_end) { - rc =3D pvalidate(vaddr, RMP_PG_SIZE_4K, validate); if (WARN(rc, "Failed to validate address 0x%lx ret %d", vaddr, rc)) sev_es_terminate(SEV_TERM_SET_LINUX, GHCB_TERM_PVALIDATE); - - vaddr =3D vaddr + PAGE_SIZE; } } =20 -static void early_set_pages_state(unsigned long paddr, unsigned int npages= , enum psc_op op) +static void early_set_pages_state(unsigned long vaddr, unsigned long paddr, + unsigned int npages, enum psc_op op) { unsigned long paddr_end; u64 val; + int ret; + + vaddr =3D vaddr & PAGE_MASK; =20 paddr =3D paddr & PAGE_MASK; paddr_end =3D paddr + ((unsigned long)npages << PAGE_SHIFT); =20 while (paddr < paddr_end) { + if (op =3D=3D SNP_PAGE_STATE_SHARED) { + /* Page validation must be rescinded before changing to shared */ + ret =3D pvalidate(vaddr, RMP_PG_SIZE_4K, false); + if (WARN(ret, "Failed to validate address 0x%lx ret %d", paddr, ret)) + goto e_term; + } + /* * Use the MSR protocol because this function can be called before * the GHCB is established. @@ -701,7 +727,15 @@ static void early_set_pages_state(unsigned long paddr,= unsigned int npages, enum paddr, GHCB_MSR_PSC_RESP_VAL(val))) goto e_term; =20 - paddr =3D paddr + PAGE_SIZE; + if (op =3D=3D SNP_PAGE_STATE_PRIVATE) { + /* Page validation must be performed after changing to private */ + ret =3D pvalidate(vaddr, RMP_PG_SIZE_4K, true); + if (WARN(ret, "Failed to validate address 0x%lx ret %d", paddr, ret)) + goto e_term; + } + + vaddr +=3D PAGE_SIZE; + paddr +=3D PAGE_SIZE; } =20 return; @@ -720,10 +754,7 @@ void __init early_snp_set_memory_private(unsigned long= vaddr, unsigned long padd * Ask the hypervisor to mark the memory pages as private in the RMP * table. */ - early_set_pages_state(paddr, npages, SNP_PAGE_STATE_PRIVATE); - - /* Validate the memory pages after they've been added in the RMP table. */ - pvalidate_pages(vaddr, npages, true); + early_set_pages_state(vaddr, paddr, npages, SNP_PAGE_STATE_PRIVATE); } =20 void __init early_snp_set_memory_shared(unsigned long vaddr, unsigned long= paddr, @@ -732,11 +763,8 @@ void __init early_snp_set_memory_shared(unsigned long = vaddr, unsigned long paddr if (!cc_platform_has(CC_ATTR_GUEST_SEV_SNP)) return; =20 - /* Invalidate the memory pages before they are marked shared in the RMP t= able. */ - pvalidate_pages(vaddr, npages, false); - /* Ask hypervisor to mark the memory pages shared in the RMP table. */ - early_set_pages_state(paddr, npages, SNP_PAGE_STATE_SHARED); + early_set_pages_state(vaddr, paddr, npages, SNP_PAGE_STATE_SHARED); } =20 void __init snp_prep_memory(unsigned long paddr, unsigned int sz, enum psc= _op op) @@ -820,10 +848,11 @@ static int vmgexit_psc(struct ghcb *ghcb, struct snp_= psc_desc *desc) return ret; } =20 -static void __set_pages_state(struct snp_psc_desc *data, unsigned long vad= dr, - unsigned long vaddr_end, int op) +static unsigned long __set_pages_state(struct snp_psc_desc *data, unsigned= long vaddr, + unsigned long vaddr_end, int op) { struct ghcb_state state; + bool use_large_entry; struct psc_hdr *hdr; struct psc_entry *e; unsigned long flags; @@ -837,27 +866,37 @@ static void __set_pages_state(struct snp_psc_desc *da= ta, unsigned long vaddr, memset(data, 0, sizeof(*data)); i =3D 0; =20 - while (vaddr < vaddr_end) { - if (is_vmalloc_addr((void *)vaddr)) + while (vaddr < vaddr_end && i < ARRAY_SIZE(data->entries)) { + hdr->end_entry =3D i; + + if (is_vmalloc_addr((void *)vaddr)) { pfn =3D vmalloc_to_pfn((void *)vaddr); - else + use_large_entry =3D false; + } else { pfn =3D __pa(vaddr) >> PAGE_SHIFT; + use_large_entry =3D true; + } =20 e->gfn =3D pfn; e->operation =3D op; - hdr->end_entry =3D i; =20 - /* - * Current SNP implementation doesn't keep track of the RMP page - * size so use 4K for simplicity. - */ - e->pagesize =3D RMP_PG_SIZE_4K; + if (use_large_entry && IS_ALIGNED(vaddr, PMD_PAGE_SIZE) && + (vaddr_end - vaddr) >=3D PMD_PAGE_SIZE) { + e->pagesize =3D RMP_PG_SIZE_2M; + vaddr +=3D PMD_PAGE_SIZE; + } else { + e->pagesize =3D RMP_PG_SIZE_4K; + vaddr +=3D PAGE_SIZE; + } =20 - vaddr =3D vaddr + PAGE_SIZE; e++; i++; } =20 + /* Page validation must be rescinded before changing to shared */ + if (op =3D=3D SNP_PAGE_STATE_SHARED) + pvalidate_pages(data); + local_irq_save(flags); =20 if (sev_cfg.ghcbs_initialized) @@ -865,6 +904,7 @@ static void __set_pages_state(struct snp_psc_desc *data= , unsigned long vaddr, else ghcb =3D boot_ghcb; =20 + /* Invoke the hypervisor to perform the page state changes */ if (!ghcb || vmgexit_psc(ghcb, data)) sev_es_terminate(SEV_TERM_SET_LINUX, GHCB_TERM_PSC); =20 @@ -872,29 +912,28 @@ static void __set_pages_state(struct snp_psc_desc *da= ta, unsigned long vaddr, __sev_put_ghcb(&state); =20 local_irq_restore(flags); + + /* Page validation must be performed after changing to private */ + if (op =3D=3D SNP_PAGE_STATE_PRIVATE) + pvalidate_pages(data); + + return vaddr; } =20 static void set_pages_state(unsigned long vaddr, unsigned int npages, int = op) { - unsigned long vaddr_end, next_vaddr; struct snp_psc_desc desc; + unsigned long vaddr_end; =20 /* Use the MSR protocol when a GHCB is not available. */ if (!boot_ghcb) - return early_set_pages_state(__pa(vaddr), npages, op); + return early_set_pages_state(vaddr, __pa(vaddr), npages, op); =20 vaddr =3D vaddr & PAGE_MASK; vaddr_end =3D vaddr + ((unsigned long)npages << PAGE_SHIFT); =20 - while (vaddr < vaddr_end) { - /* Calculate the last vaddr that fits in one struct snp_psc_desc. */ - next_vaddr =3D min_t(unsigned long, vaddr_end, - (VMGEXIT_PSC_MAX_ENTRY * PAGE_SIZE) + vaddr); - - __set_pages_state(&desc, vaddr, next_vaddr, op); - - vaddr =3D next_vaddr; - } + while (vaddr < vaddr_end) + vaddr =3D __set_pages_state(&desc, vaddr, vaddr_end, op); } =20 void snp_set_memory_shared(unsigned long vaddr, unsigned int npages) @@ -902,8 +941,6 @@ void snp_set_memory_shared(unsigned long vaddr, unsigne= d int npages) if (!cc_platform_has(CC_ATTR_GUEST_SEV_SNP)) return; =20 - pvalidate_pages(vaddr, npages, false); - set_pages_state(vaddr, npages, SNP_PAGE_STATE_SHARED); } =20 @@ -913,8 +950,6 @@ void snp_set_memory_private(unsigned long vaddr, unsign= ed int npages) return; =20 set_pages_state(vaddr, npages, SNP_PAGE_STATE_PRIVATE); - - pvalidate_pages(vaddr, npages, true); } =20 static int snp_set_vmsa(void *va, bool vmsa) --=20 2.37.3