include/sysemu/kvm.h | 1 + accel/kvm/kvm-all.c | 2 +- target/arm/kvm.c | 2 ++ target/i386/kvm/kvm.c | 2 ++ target/mips/kvm.c | 2 ++ target/ppc/kvm.c | 2 ++ target/riscv/kvm.c | 2 ++ target/s390x/kvm/kvm.c | 2 ++ 8 files changed, 14 insertions(+), 1 deletion(-)
libvirt uses "none" machine type to test KVM availability. Before this
change, QEMU used to pass 0 as machine type when calling KVM_CREATE_VM.
The kernel documentation says:
> On arm64, the physical address size for a VM (IPA Size limit) is
> limited to 40bits by default. The limit can be configured if the host
> supports the extension KVM_CAP_ARM_VM_IPA_SIZE. When supported, use
> KVM_VM_TYPE_ARM_IPA_SIZE(IPA_Bits) to set the size in the machine type
> identifier, where IPA_Bits is the maximum width of any physical
> address used by the VM. The IPA_Bits is encoded in bits[7-0] of the
> machine type identifier.
>
> e.g, to configure a guest to use 48bit physical address size::
>
> vm_fd = ioctl(dev_fd, KVM_CREATE_VM, KVM_VM_TYPE_ARM_IPA_SIZE(48));
>
> The requested size (IPA_Bits) must be:
>
> == =========================================================
> 0 Implies default size, 40bits (for backward compatibility)
> N Implies N bits, where N is a positive integer such that,
> 32 <= N <= Host_IPA_Limit
> == =========================================================
> Host_IPA_Limit is the maximum possible value for IPA_Bits on the host
> and is dependent on the CPU capability and the kernel configuration.
> The limit can be retrieved using KVM_CAP_ARM_VM_IPA_SIZE of the
> KVM_CHECK_EXTENSION ioctl() at run-time.
>
> Creation of the VM will fail if the requested IPA size (whether it is
> implicit or explicit) is unsupported on the host.
https://docs.kernel.org/virt/kvm/api.html#kvm-create-vm
So if Host_IPA_Limit < 40, such KVM_CREATE_VM will fail, and libvirt
incorrectly thinks KVM is not available. This actually happened on M2
MacBook Air.
Fix this by specifying 32 for IPA_Bits as any arm64 system should
support the value according to the documentation.
Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
---
V1 -> V2: Introduced an arch hook
include/sysemu/kvm.h | 1 +
accel/kvm/kvm-all.c | 2 +-
target/arm/kvm.c | 2 ++
target/i386/kvm/kvm.c | 2 ++
target/mips/kvm.c | 2 ++
target/ppc/kvm.c | 2 ++
target/riscv/kvm.c | 2 ++
target/s390x/kvm/kvm.c | 2 ++
8 files changed, 14 insertions(+), 1 deletion(-)
diff --git a/include/sysemu/kvm.h b/include/sysemu/kvm.h
index 115f0cca79..7cc2eb1a8c 100644
--- a/include/sysemu/kvm.h
+++ b/include/sysemu/kvm.h
@@ -347,6 +347,7 @@ bool kvm_device_supported(int vmfd, uint64_t type);
/* Arch specific hooks */
+extern const int kvm_arch_default_type;
extern const KVMCapabilityInfo kvm_arch_required_capabilities[];
void kvm_arch_accel_class_init(ObjectClass *oc);
diff --git a/accel/kvm/kvm-all.c b/accel/kvm/kvm-all.c
index 373d876c05..914ade3ec3 100644
--- a/accel/kvm/kvm-all.c
+++ b/accel/kvm/kvm-all.c
@@ -2458,7 +2458,7 @@ static int kvm_init(MachineState *ms)
KVMState *s;
const KVMCapabilityInfo *missing_cap;
int ret;
- int type = 0;
+ int type = kvm_arch_default_type;
uint64_t dirty_log_manual_caps;
qemu_mutex_init(&kml_slots_lock);
diff --git a/target/arm/kvm.c b/target/arm/kvm.c
index b4c7654f49..f8203b9915 100644
--- a/target/arm/kvm.c
+++ b/target/arm/kvm.c
@@ -32,6 +32,8 @@
#include "hw/irq.h"
#include "qemu/log.h"
+const int kvm_arch_default_type = 32;
+
const KVMCapabilityInfo kvm_arch_required_capabilities[] = {
KVM_CAP_LAST_INFO
};
diff --git a/target/i386/kvm/kvm.c b/target/i386/kvm/kvm.c
index ebfaf3d24c..00bccffdfc 100644
--- a/target/i386/kvm/kvm.c
+++ b/target/i386/kvm/kvm.c
@@ -87,6 +87,8 @@
static void kvm_init_msrs(X86CPU *cpu);
+const int kvm_arch_default_type;
+
const KVMCapabilityInfo kvm_arch_required_capabilities[] = {
KVM_CAP_INFO(SET_TSS_ADDR),
KVM_CAP_INFO(EXT_CPUID),
diff --git a/target/mips/kvm.c b/target/mips/kvm.c
index c14e8f550f..754366874e 100644
--- a/target/mips/kvm.c
+++ b/target/mips/kvm.c
@@ -33,6 +33,8 @@
static int kvm_mips_fpu_cap;
static int kvm_mips_msa_cap;
+const int kvm_arch_default_type;
+
const KVMCapabilityInfo kvm_arch_required_capabilities[] = {
KVM_CAP_LAST_INFO
};
diff --git a/target/ppc/kvm.c b/target/ppc/kvm.c
index a8a935e267..86f8b645a5 100644
--- a/target/ppc/kvm.c
+++ b/target/ppc/kvm.c
@@ -54,6 +54,8 @@
#define DEBUG_RETURN_GUEST 0
#define DEBUG_RETURN_GDB 1
+const int kvm_arch_default_type;
+
const KVMCapabilityInfo kvm_arch_required_capabilities[] = {
KVM_CAP_LAST_INFO
};
diff --git a/target/riscv/kvm.c b/target/riscv/kvm.c
index 9d8a8982f9..fbe56ec3bb 100644
--- a/target/riscv/kvm.c
+++ b/target/riscv/kvm.c
@@ -764,6 +764,8 @@ void kvm_riscv_init_user_properties(Object *cpu_obj)
kvm_riscv_destroy_scratch_vcpu(&kvmcpu);
}
+const int kvm_arch_default_type;
+
const KVMCapabilityInfo kvm_arch_required_capabilities[] = {
KVM_CAP_LAST_INFO
};
diff --git a/target/s390x/kvm/kvm.c b/target/s390x/kvm/kvm.c
index a9e5880349..4b9347ffc8 100644
--- a/target/s390x/kvm/kvm.c
+++ b/target/s390x/kvm/kvm.c
@@ -144,6 +144,8 @@ static CPUWatchpoint hw_watchpoint;
static struct kvm_hw_breakpoint *hw_breakpoints;
static int nb_hw_breakpoints;
+const int kvm_arch_default_type;
+
const KVMCapabilityInfo kvm_arch_required_capabilities[] = {
KVM_CAP_LAST_INFO
};
--
2.41.0
Hi Akihiko, On 21/7/23 08:24, Akihiko Odaki wrote: > libvirt uses "none" machine type to test KVM availability. Before this > change, QEMU used to pass 0 as machine type when calling KVM_CREATE_VM. > > The kernel documentation says: >> On arm64, the physical address size for a VM (IPA Size limit) is >> limited to 40bits by default. The limit can be configured if the host >> supports the extension KVM_CAP_ARM_VM_IPA_SIZE. When supported, use >> KVM_VM_TYPE_ARM_IPA_SIZE(IPA_Bits) to set the size in the machine type >> identifier, where IPA_Bits is the maximum width of any physical >> address used by the VM. The IPA_Bits is encoded in bits[7-0] of the >> machine type identifier. >> >> e.g, to configure a guest to use 48bit physical address size:: >> >> vm_fd = ioctl(dev_fd, KVM_CREATE_VM, KVM_VM_TYPE_ARM_IPA_SIZE(48)); >> >> The requested size (IPA_Bits) must be: >> >> == ========================================================= >> 0 Implies default size, 40bits (for backward compatibility) >> N Implies N bits, where N is a positive integer such that, >> 32 <= N <= Host_IPA_Limit >> == ========================================================= > >> Host_IPA_Limit is the maximum possible value for IPA_Bits on the host >> and is dependent on the CPU capability and the kernel configuration. >> The limit can be retrieved using KVM_CAP_ARM_VM_IPA_SIZE of the >> KVM_CHECK_EXTENSION ioctl() at run-time. >> >> Creation of the VM will fail if the requested IPA size (whether it is >> implicit or explicit) is unsupported on the host. > https://docs.kernel.org/virt/kvm/api.html#kvm-create-vm > > So if Host_IPA_Limit < 40, such KVM_CREATE_VM will fail, and libvirt > incorrectly thinks KVM is not available. This actually happened on M2 > MacBook Air. > > Fix this by specifying 32 for IPA_Bits as any arm64 system should > support the value according to the documentation. > > Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com> > --- > V1 -> V2: Introduced an arch hook > > include/sysemu/kvm.h | 1 + > accel/kvm/kvm-all.c | 2 +- > target/arm/kvm.c | 2 ++ > target/i386/kvm/kvm.c | 2 ++ > target/mips/kvm.c | 2 ++ > target/ppc/kvm.c | 2 ++ > target/riscv/kvm.c | 2 ++ > target/s390x/kvm/kvm.c | 2 ++ > 8 files changed, 14 insertions(+), 1 deletion(-) My understanding of Peter's suggestion would be smth like: -- >8 -- diff --git a/include/sysemu/kvm.h b/include/sysemu/kvm.h index 115f0cca79..c0af15eb6c 100644 --- a/include/sysemu/kvm.h +++ b/include/sysemu/kvm.h @@ -201,10 +201,15 @@ typedef struct KVMCapabilityInfo { struct KVMState; +struct KVMClass { + AccelClass parent_class; + + int default_vm_type; +}; + #define TYPE_KVM_ACCEL ACCEL_CLASS_NAME("kvm") typedef struct KVMState KVMState; -DECLARE_INSTANCE_CHECKER(KVMState, KVM_STATE, - TYPE_KVM_ACCEL) +OBJECT_DECLARE_TYPE(KVMState, KVMClass, KVM_ACCEL) extern KVMState *kvm_state; typedef struct Notifier Notifier; diff --git a/accel/kvm/kvm-all.c b/accel/kvm/kvm-all.c index 373d876c05..fdd424e1a5 100644 --- a/accel/kvm/kvm-all.c +++ b/accel/kvm/kvm-all.c @@ -2458,12 +2458,13 @@ static int kvm_init(MachineState *ms) KVMState *s; const KVMCapabilityInfo *missing_cap; int ret; - int type = 0; + int type; uint64_t dirty_log_manual_caps; qemu_mutex_init(&kml_slots_lock); s = KVM_STATE(ms->accelerator); + type = KVM_GET_CLASS(s)->default_vm_type; /* * On systems where the kernel can support different base page diff --git a/target/arm/kvm.c b/target/arm/kvm.c index b4c7654f49..5c13594fdf 100644 --- a/target/arm/kvm.c +++ b/target/arm/kvm.c @@ -1064,4 +1064,8 @@ bool kvm_arch_cpu_check_are_resettable(void) void kvm_arch_accel_class_init(ObjectClass *oc) { + KVMClass *kc = KVM_CLASS(oc); + + /* Host_IPA_Limit ... */ + kc->default_vm_type = 32; } ---
On Fri, 21 Jul 2023 at 08:30, Philippe Mathieu-Daudé <philmd@linaro.org> wrote: > > Hi Akihiko, > > On 21/7/23 08:24, Akihiko Odaki wrote: > > libvirt uses "none" machine type to test KVM availability. Before this > > change, QEMU used to pass 0 as machine type when calling KVM_CREATE_VM. > > > > The kernel documentation says: > >> On arm64, the physical address size for a VM (IPA Size limit) is > >> limited to 40bits by default. The limit can be configured if the host > >> supports the extension KVM_CAP_ARM_VM_IPA_SIZE. When supported, use > >> KVM_VM_TYPE_ARM_IPA_SIZE(IPA_Bits) to set the size in the machine type > >> identifier, where IPA_Bits is the maximum width of any physical > >> address used by the VM. The IPA_Bits is encoded in bits[7-0] of the > >> machine type identifier. > >> > >> e.g, to configure a guest to use 48bit physical address size:: > >> > >> vm_fd = ioctl(dev_fd, KVM_CREATE_VM, KVM_VM_TYPE_ARM_IPA_SIZE(48)); > >> > >> The requested size (IPA_Bits) must be: > >> > >> == ========================================================= > >> 0 Implies default size, 40bits (for backward compatibility) > >> N Implies N bits, where N is a positive integer such that, > >> 32 <= N <= Host_IPA_Limit > >> == ========================================================= > > > >> Host_IPA_Limit is the maximum possible value for IPA_Bits on the host > >> and is dependent on the CPU capability and the kernel configuration. > >> The limit can be retrieved using KVM_CAP_ARM_VM_IPA_SIZE of the > >> KVM_CHECK_EXTENSION ioctl() at run-time. > >> > >> Creation of the VM will fail if the requested IPA size (whether it is > >> implicit or explicit) is unsupported on the host. > > https://docs.kernel.org/virt/kvm/api.html#kvm-create-vm > > > > So if Host_IPA_Limit < 40, such KVM_CREATE_VM will fail, and libvirt > > incorrectly thinks KVM is not available. This actually happened on M2 > > MacBook Air. > > > > Fix this by specifying 32 for IPA_Bits as any arm64 system should > > support the value according to the documentation. > > > > Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com> > > --- > > V1 -> V2: Introduced an arch hook > > > > include/sysemu/kvm.h | 1 + > > accel/kvm/kvm-all.c | 2 +- > > target/arm/kvm.c | 2 ++ > > target/i386/kvm/kvm.c | 2 ++ > > target/mips/kvm.c | 2 ++ > > target/ppc/kvm.c | 2 ++ > > target/riscv/kvm.c | 2 ++ > > target/s390x/kvm/kvm.c | 2 ++ > > 8 files changed, 14 insertions(+), 1 deletion(-) > > My understanding of Peter's suggestion would be smth like: > > -- >8 -- > diff --git a/include/sysemu/kvm.h b/include/sysemu/kvm.h > index 115f0cca79..c0af15eb6c 100644 > --- a/include/sysemu/kvm.h > +++ b/include/sysemu/kvm.h > @@ -201,10 +201,15 @@ typedef struct KVMCapabilityInfo { > > struct KVMState; > > +struct KVMClass { > + AccelClass parent_class; > + > + int default_vm_type; The kernel docs say you need to check for the KVM_CAP_ARM_VM_IPA_SIZE before you can pass something other than zero to the KVM_CREATE_VM ioctl, so this needs to be a method, not just a value. (kvm_arm_get_max_vm_ipa_size() will do this bit for you.) If the machine doesn't provide a kvm_type method, we should default to "largest the host supports", I think. I was wondering if we could have one per-arch method for "actually create the VM" that both was a place for arm to set the default vm type and also let us get the TARGET_S390X and TARGET_PPC ifdefs out of this bit of kvm-all.c, but maybe that would look just a bit too awkward: if (kc->create_vm(s, board_sets_kvm_type, board_kvm_type) < 0) { goto err; } where board_sets_kvm_type is a bool, true if board_kvm_type is valid, and board_kvm_type is whatever the board's mc->kvm_type method told us. (Default impl of the method: call KVM_CREATE_VM ioctl with retry-on-eintr, printing the simple error message; PPC and s390 versions similar but with their arch specific extra messages; arm version has a different default type if board_sets_kvm_type is false.) Not trying to do both of those things with one method would result in a simpler type = kc->get_default_kvm_type(s); API. thanks -- PMM
© 2016 - 2024 Red Hat, Inc.