The ITS is not properly reset at the moment. It is possible the
GITS_BASER<n>.valid is set and the in-kernel ITS caches are not
empty (list of devices, collections, LPIs) while data structures
in guest RAM are invalid/inconsistent.
For instance, this happens after a guest shutdown -r now or a
system reset, if we save the state before the guest re-writes
the ITS registers or map devices, the table save ioctl may
produce a QEMU abort.
Until there is a proper reset implemented, let's unplug the
consistency error checking.
The reset issue will be fixed in subsequent patches.
Signed-off-by: Eric Auger <eric.auger@redhat.com>
Reported-by: wanghaibin <wanghaibin.wang@huawei.com>
---
this patch would deserve being cc'ed stable (2.10)
---
hw/intc/arm_gicv3_its_kvm.c | 15 ++++++++-------
1 file changed, 8 insertions(+), 7 deletions(-)
diff --git a/hw/intc/arm_gicv3_its_kvm.c b/hw/intc/arm_gicv3_its_kvm.c
index 39903d5..120b86d 100644
--- a/hw/intc/arm_gicv3_its_kvm.c
+++ b/hw/intc/arm_gicv3_its_kvm.c
@@ -64,20 +64,16 @@ static void vm_change_state_handler(void *opaque, int running,
{
GICv3ITSState *s = (GICv3ITSState *)opaque;
Error *err = NULL;
- int ret;
if (running) {
return;
}
- ret = kvm_device_access(s->dev_fd, KVM_DEV_ARM_VGIC_GRP_CTRL,
- KVM_DEV_ARM_ITS_SAVE_TABLES, NULL, true, &err);
+ kvm_device_access(s->dev_fd, KVM_DEV_ARM_VGIC_GRP_CTRL,
+ KVM_DEV_ARM_ITS_SAVE_TABLES, NULL, true, &err);
if (err) {
error_report_err(err);
}
- if (ret < 0 && ret != -EFAULT) {
- abort();
- }
}
static void kvm_arm_its_realize(DeviceState *dev, Error **errp)
@@ -157,6 +153,7 @@ static void kvm_arm_its_pre_save(GICv3ITSState *s)
*/
static void kvm_arm_its_post_load(GICv3ITSState *s)
{
+ Error *err = NULL;
int i;
if (!s->iidr) {
@@ -188,7 +185,11 @@ static void kvm_arm_its_post_load(GICv3ITSState *s)
kvm_device_access(s->dev_fd, KVM_DEV_ARM_VGIC_GRP_CTRL,
KVM_DEV_ARM_ITS_RESTORE_TABLES, NULL, true,
- &error_abort);
+ &err);
+
+ if (err) {
+ error_report_err(err);
+ }
kvm_device_access(s->dev_fd, KVM_DEV_ARM_VGIC_GRP_ITS_REGS,
GITS_CTLR, &s->ctlr, true, &error_abort);
--
2.5.5
On 27 September 2017 at 15:56, Eric Auger <eric.auger@redhat.com> wrote: > The ITS is not properly reset at the moment. It is possible the > GITS_BASER<n>.valid is set and the in-kernel ITS caches are not > empty (list of devices, collections, LPIs) while data structures > in guest RAM are invalid/inconsistent. > > For instance, this happens after a guest shutdown -r now or a > system reset, if we save the state before the guest re-writes > the ITS registers or map devices, the table save ioctl may > produce a QEMU abort. > > Until there is a proper reset implemented, let's unplug the > consistency error checking. > > The reset issue will be fixed in subsequent patches. > > Signed-off-by: Eric Auger <eric.auger@redhat.com> > Reported-by: wanghaibin <wanghaibin.wang@huawei.com> When in particular does this cause an abort -- when we're trying to save the state in these edge cases, or when we're trying to restore it? What does the kernel do -- is it just rejecting the attempt, or might it actually have done bad things to guest memory ? thanks -- PMM
Hi Peter, On 12/10/2017 13:54, Peter Maydell wrote: > On 27 September 2017 at 15:56, Eric Auger <eric.auger@redhat.com> wrote: >> The ITS is not properly reset at the moment. It is possible the >> GITS_BASER<n>.valid is set and the in-kernel ITS caches are not >> empty (list of devices, collections, LPIs) while data structures >> in guest RAM are invalid/inconsistent. >> >> For instance, this happens after a guest shutdown -r now or a >> system reset, if we save the state before the guest re-writes >> the ITS registers or map devices, the table save ioctl may >> produce a QEMU abort. >> >> Until there is a proper reset implemented, let's unplug the >> consistency error checking. >> >> The reset issue will be fixed in subsequent patches. >> >> Signed-off-by: Eric Auger <eric.auger@redhat.com> >> Reported-by: wanghaibin <wanghaibin.wang@huawei.com> > > When in particular does this cause an abort -- when we're > trying to save the state in these edge cases, or when we're > trying to restore it? Both. After a guest reset, device GITS_BASER<n> may point to an L1 device table that is not valid anymore. In that case device table save IOTCL returned -EINVAL and we aborted. On restore we had a bug in case all data in the table is invalid. In that case as well we currently abort. What does the kernel do -- is it just > rejecting the attempt, or might it actually have done bad > things to guest memory ? On Save, in case the GITS_BASER points to an invalid linear device tree table but the page still corresponds to a visible gfn window, this memory could be overwritten by the kernel. Thanks Eric > > thanks > -- PMM >
© 2016 - 2026 Red Hat, Inc.