From nobody Thu Nov 6 08:25:33 2025 Delivered-To: importer@patchew.org Received-SPF: pass (zoho.com: domain of gnu.org designates 208.118.235.17 as permitted sender) client-ip=208.118.235.17; envelope-from=qemu-devel-bounces+importer=patchew.org@nongnu.org; helo=lists.gnu.org; Authentication-Results: mx.zohomail.com; dkim=fail; spf=pass (zoho.com: domain of gnu.org designates 208.118.235.17 as permitted sender) smtp.mailfrom=qemu-devel-bounces+importer=patchew.org@nongnu.org; dmarc=fail(p=none dis=none) header.from=nvidia.com Return-Path: Received: from lists.gnu.org (lists.gnu.org [208.118.235.17]) by mx.zohomail.com with SMTPS id 1539715211093238.53612672921804; Tue, 16 Oct 2018 11:40:11 -0700 (PDT) Received: from localhost ([::1]:59693 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gCUGH-0006UT-LE for importer@patchew.org; Tue, 16 Oct 2018 14:40:09 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:42125) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gCTwT-0006no-FM for qemu-devel@nongnu.org; Tue, 16 Oct 2018 14:19:44 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gCTwP-0003rY-PS for qemu-devel@nongnu.org; Tue, 16 Oct 2018 14:19:41 -0400 Received: from hqemgate16.nvidia.com ([216.228.121.65]:12846) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1gCTwN-0003fh-Pe for qemu-devel@nongnu.org; Tue, 16 Oct 2018 14:19:37 -0400 Received: from hqpgpgate102.nvidia.com (Not Verified[216.228.121.13]) by hqemgate16.nvidia.com (using TLS: TLSv1.2, DES-CBC3-SHA) id ; Tue, 16 Oct 2018 11:14:27 -0700 Received: from HQMAIL104.nvidia.com ([172.20.161.6]) by hqpgpgate102.nvidia.com (PGP Universal service); Tue, 16 Oct 2018 11:14:23 -0700 Received: from HQMAIL106.nvidia.com (172.18.146.12) by HQMAIL104.nvidia.com (172.18.146.11) with Microsoft SMTP Server (TLS) id 15.0.1395.4; Tue, 16 Oct 2018 18:14:23 +0000 Received: from kwankhede-dev.nvidia.com (172.20.13.39) by HQMAIL106.nvidia.com (172.18.146.12) with Microsoft SMTP Server (TLS) id 15.0.1395.4 via Frontend Transport; Tue, 16 Oct 2018 18:14:21 +0000 X-PGP-Universal: processed; by hqpgpgate102.nvidia.com on Tue, 16 Oct 2018 11:14:23 -0700 From: Kirti Wankhede To: , Date: Tue, 16 Oct 2018 23:42:36 +0530 Message-ID: <1539713558-2453-3-git-send-email-kwankhede@nvidia.com> X-Mailer: git-send-email 2.7.0 In-Reply-To: <1539713558-2453-1-git-send-email-kwankhede@nvidia.com> References: <1539713558-2453-1-git-send-email-kwankhede@nvidia.com> X-NVConfidentiality: public MIME-Version: 1.0 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nvidia.com; s=n1; t=1539713667; bh=BY0F/ZbqwFwzg/fSTGqa5BBas0hGccE16ingupmBxSs=; h=X-PGP-Universal:From:To:CC:Subject:Date:Message-ID:X-Mailer: In-Reply-To:References:X-NVConfidentiality:MIME-Version: Content-Type; b=A2nic76IIYWqKFYaEyf/2wX6eZNLd/UsOJZ4u9uejvS870/gneMS5ikBFv7d+Pto/ HMXld3MIi0Ho5jdwTg4Zl0/OkRq0FcJrbLInFXSGpnw19Tkshgwxx8mkQU8St/7Y6C PxtZIDiQ3/mCh/BIZYrWz8eP0tfDMhi0mzFXPY0RNGV+gPppKzXLbg2I+zssYwDipo cZDnm3DJZHcBJcWYC6CxXFP/fH7Z+jrLSWUJoakSCi744KoGY3ihzzVemffViDTXg3 00zm3xc+OH4653iYp39Moeuxf0xYaF9C7/Ux9SSV2QeKiPBzp/CL6gOQuz52HDXRlY IQCa3Epo/FP2Q== X-detected-operating-system: by eggs.gnu.org: Windows 7 or 8 X-Received-From: 216.228.121.65 Subject: [Qemu-devel] [RFC PATCH v1 2/4] Add migration functions for VFIO devices X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Kirti Wankhede , qemu-devel@nongnu.org, kvm@vger.kernel.org Errors-To: qemu-devel-bounces+importer=patchew.org@nongnu.org Sender: "Qemu-devel" X-ZohoMail-DKIM: fail (Header signature does not verify) X-ZohoMail: RDMRC_1 RDKM_2 RSF_0 Z_629925259 SPT_0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" - Migration function are implemented for VFIO_DEVICE_TYPE_PCI device. - Added SaveVMHandlers and implemented all basic functions required for live migration. - Added VM state change handler to know running or stopped state of VM. - Added migration state change notifier to get notification on migration st= ate change. This state is translated to VFIO device state and conveyed to ven= dor driver. Signed-off-by: Kirti Wankhede Reviewed-by: Neo Jia --- hw/vfio/Makefile.objs | 2 +- hw/vfio/migration.c | 716 ++++++++++++++++++++++++++++++++++++++= ++++ include/hw/vfio/vfio-common.h | 23 ++ 3 files changed, 740 insertions(+), 1 deletion(-) create mode 100644 hw/vfio/migration.c diff --git a/hw/vfio/Makefile.objs b/hw/vfio/Makefile.objs index a2e7a0a7cf02..6206ad47e90e 100644 --- a/hw/vfio/Makefile.objs +++ b/hw/vfio/Makefile.objs @@ -1,6 +1,6 @@ ifeq ($(CONFIG_LINUX), y) obj-$(CONFIG_SOFTMMU) +=3D common.o -obj-$(CONFIG_PCI) +=3D pci.o pci-quirks.o display.o +obj-$(CONFIG_PCI) +=3D pci.o pci-quirks.o display.o migration.o obj-$(CONFIG_VFIO_CCW) +=3D ccw.o obj-$(CONFIG_SOFTMMU) +=3D platform.o obj-$(CONFIG_VFIO_XGMAC) +=3D calxeda-xgmac.o diff --git a/hw/vfio/migration.c b/hw/vfio/migration.c new file mode 100644 index 000000000000..8a4f515226e0 --- /dev/null +++ b/hw/vfio/migration.c @@ -0,0 +1,716 @@ +/* + * Migration support for VFIO devices + * + * Copyright NVIDIA, Inc. 2018 + * + * This work is licensed under the terms of the GNU GPL, version 2. See + * the COPYING file in the top-level directory. + */ + +#include "qemu/osdep.h" +#include +#include + +#include "hw/vfio/vfio-common.h" +#include "cpu.h" +#include "migration/migration.h" +#include "migration/qemu-file.h" +#include "migration/register.h" +#include "migration/blocker.h" +#include "migration/misc.h" +#include "qapi/error.h" +#include "exec/ramlist.h" +#include "exec/ram_addr.h" +#include "pci.h" + +/* + * Flags used as delimiter: + * 0xffffffff =3D> MSB 32-bit all 1s + * 0xef10 =3D> emulated (virtual) function IO + * 0x0000 =3D> 16-bits reserved for flags + */ +#define VFIO_MIG_FLAG_END_OF_STATE (0xffffffffef100001ULL) +#define VFIO_MIG_FLAG_DEV_CONFIG_STATE (0xffffffffef100002ULL) +#define VFIO_MIG_FLAG_DEV_SETUP_STATE (0xffffffffef100003ULL) + +static void vfio_migration_region_exit(VFIODevice *vbasedev) +{ + VFIOMigration *migration =3D vbasedev->migration; + + if (!migration) { + return; + } + + if (migration->region.buffer.size) { + vfio_region_exit(&migration->region.buffer); + vfio_region_finalize(&migration->region.buffer); + } + g_free(vbasedev->migration); +} + +static int vfio_migration_region_init(VFIODevice *vbasedev) +{ + VFIOMigration *migration; + Object *obj =3D NULL; + int ret; + struct vfio_device_migration_info migration_info =3D { + .argsz =3D sizeof(migration_info), + .flags =3D VFIO_MIGRATION_GET_REGION, + }; + + /* Migration support added for PCI device only */ + if (vbasedev->type =3D=3D VFIO_DEVICE_TYPE_PCI) { + VFIOPCIDevice *vdev =3D container_of(vbasedev, VFIOPCIDevice, vbas= edev); + + obj =3D OBJECT(vdev); + } else + return -EINVAL; + + ret =3D ioctl(vbasedev->fd, VFIO_DEVICE_MIGRATION_INFO, &migration_inf= o); + if (ret < 0) { + error_report("Failed to migration region %s", + strerror(errno)); + return ret; + } + + if (!migration_info.size || !migration_info.region_index) { + error_report("Incorrect migration region params index: %d,size: 0x= %llx", + migration_info.region_index, migration_info.size); + return -EINVAL; + } + + vbasedev->migration =3D g_new0(VFIOMigration, 1); + migration =3D vbasedev->migration; + + migration->region.index =3D migration_info.region_index; + + ret =3D vfio_region_setup(obj, vbasedev, + &migration->region.buffer, + migration_info.region_index, + "migration"); + if (ret !=3D 0) { + error_report("%s: vfio_region_setup(%d): %s", + __func__, migration_info.region_index, strerror(-ret)); + goto err; + } + + if (migration->region.buffer.mmaps =3D=3D NULL) { + ret =3D -EINVAL; + error_report("%s: Migration region (%d) not mappable : %s", + __func__, migration_info.region_index, strerror(-ret)); + goto err; + } + + ret =3D vfio_region_mmap(&migration->region.buffer); + if (ret !=3D 0) { + error_report("%s: vfio_region_mmap(%d): %s", __func__, + migration_info.region_index, strerror(-ret)); + goto err; + } + assert(migration->region.buffer.mmaps[0].mmap !=3D NULL); + + return 0; + +err: + vfio_migration_region_exit(vbasedev); + return ret; +} + +static int vfio_migration_set_state(VFIODevice *vbasedev, uint32_t state) +{ + int ret =3D 0; + struct vfio_device_migration_info migration_info =3D { + .argsz =3D sizeof(migration_info), + .flags =3D VFIO_MIGRATION_SET_STATE, + .device_state =3D state, + }; + + if (vbasedev->device_state =3D=3D state) { + return ret; + } + + ret =3D ioctl(vbasedev->fd, VFIO_DEVICE_MIGRATION_INFO, &migration_inf= o); + if (ret < 0) { + error_report("Failed to set migration state %d %s", + ret, strerror(errno)); + return ret; + } + + vbasedev->device_state =3D state; + return ret; +} + +void vfio_get_dirty_page_list(VFIODevice *vbasedev, + uint64_t start_addr, + uint64_t pfn_count) +{ + uint64_t count =3D 0; + int ret; + struct vfio_device_migration_info *migration_info; + uint64_t bitmap_size; + + bitmap_size =3D (BITS_TO_LONGS(pfn_count) + 1) * sizeof(unsigned long); + + migration_info =3D g_malloc0(sizeof(*migration_info) + bitmap_size); + if (!migration_info) { + error_report("Failed to allocated migration_info %s", + strerror(errno)); + return; + } + + memset(migration_info, 0, sizeof(*migration_info) + bitmap_size); + migration_info->flags =3D VFIO_MIGRATION_GET_DIRTY_PFNS, + migration_info->start_addr =3D start_addr; + migration_info->pfn_count =3D pfn_count; + migration_info->argsz =3D sizeof(*migration_info) + bitmap_size; + + ret =3D ioctl(vbasedev->fd, VFIO_DEVICE_MIGRATION_INFO, migration_info= ); + if (ret < 0) { + error_report("Failed to get dirty pages bitmap %d %s", + ret, strerror(errno)); + g_free(migration_info); + return; + } + + if (migration_info->pfn_count) { + cpu_physical_memory_set_dirty_lebitmap( + (unsigned long *)&migration_info->dirty_bitmap, + migration_info->start_addr, migration_info->pfn_count); + count +=3D migration_info->pfn_count; + } + g_free(migration_info); +} + +static int vfio_save_device_config_state(QEMUFile *f, void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + + qemu_put_be64(f, VFIO_MIG_FLAG_DEV_CONFIG_STATE); + + if (vbasedev->type =3D=3D VFIO_DEVICE_TYPE_PCI) { + VFIOPCIDevice *vdev =3D container_of(vbasedev, VFIOPCIDevice, vbas= edev); + PCIDevice *pdev =3D &vdev->pdev; + uint32_t msi_flags, msi_addr_lo, msi_addr_hi =3D 0, msi_data; + bool msi_64bit; + int i; + + for (i =3D 0; i < PCI_ROM_SLOT; i++) { + uint32_t bar; + + bar =3D pci_default_read_config(pdev, PCI_BASE_ADDRESS_0 + i *= 4, 4); + qemu_put_be32(f, bar); + } + + msi_flags =3D pci_default_read_config(pdev, + pdev->msi_cap + PCI_MSI_FLAGS,= 2); + msi_64bit =3D (msi_flags & PCI_MSI_FLAGS_64BIT); + + msi_addr_lo =3D pci_default_read_config(pdev, + pdev->msi_cap + PCI_MSI_ADDRESS_L= O, 4); + qemu_put_be32(f, msi_addr_lo); + + if (msi_64bit) { + msi_addr_hi =3D pci_default_read_config(pdev, + pdev->msi_cap + PCI_MSI_ADDRES= S_HI, + 4); + } + qemu_put_be32(f, msi_addr_hi); + + msi_data =3D pci_default_read_config(pdev, + pdev->msi_cap + (msi_64bit ? PCI_MSI_DATA_64 : PCI_MSI_DAT= A_32), + 2); + qemu_put_be32(f, msi_data); + } + qemu_put_be64(f, VFIO_MIG_FLAG_END_OF_STATE); + + return qemu_file_get_error(f); +} + +static int vfio_load_device_config_state(QEMUFile *f, void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + + if (vbasedev->type =3D=3D VFIO_DEVICE_TYPE_PCI) { + VFIOPCIDevice *vdev =3D container_of(vbasedev, VFIOPCIDevice, vbas= edev); + PCIDevice *pdev =3D &vdev->pdev; + uint32_t pci_cmd; + uint32_t msi_flags, msi_addr_lo, msi_addr_hi =3D 0, msi_data; + bool msi_64bit; + int i; + + /* retore pci bar configuration */ + pci_cmd =3D pci_default_read_config(pdev, PCI_COMMAND, 2); + vfio_pci_write_config(pdev, PCI_COMMAND, + pci_cmd & (!(PCI_COMMAND_IO | PCI_COMMAND_MEMORY)= ), 2); + for (i =3D 0; i < PCI_ROM_SLOT; i++) { + uint32_t bar =3D qemu_get_be32(f); + + vfio_pci_write_config(pdev, PCI_BASE_ADDRESS_0 + i * 4, bar, 4= ); + } + vfio_pci_write_config(pdev, PCI_COMMAND, + pci_cmd | PCI_COMMAND_IO | PCI_COMMAND_MEMOR= Y, 2); + + /* restore msi configuration */ + msi_flags =3D pci_default_read_config(pdev, + pdev->msi_cap + PCI_MSI_FLAGS, + 2); + msi_64bit =3D (msi_flags & PCI_MSI_FLAGS_64BIT); + + vfio_pci_write_config(&vdev->pdev, + pdev->msi_cap + PCI_MSI_FLAGS, + msi_flags & (!PCI_MSI_FLAGS_ENABLE), + 2); + + msi_addr_lo =3D qemu_get_be32(f); + vfio_pci_write_config(pdev, + pdev->msi_cap + PCI_MSI_ADDRESS_LO, + msi_addr_lo, + 4); + + msi_addr_hi =3D qemu_get_be32(f); + if (msi_64bit) { + vfio_pci_write_config(pdev, pdev->msi_cap + PCI_MSI_ADDRESS_HI, + msi_addr_hi, 4); + } + msi_data =3D qemu_get_be32(f); + vfio_pci_write_config(pdev, + pdev->msi_cap + (msi_64bit ? PCI_MSI_DATA_64= : + PCI_MSI_DATA_32= ), + msi_data, + 2); + + vfio_pci_write_config(&vdev->pdev, + pdev->msi_cap + PCI_MSI_FLAGS, + msi_flags | PCI_MSI_FLAGS_ENABLE, + 2); + } + + if (qemu_get_be64(f) !=3D VFIO_MIG_FLAG_END_OF_STATE) { + error_report("%s Wrong end of block ", __func__); + return -EINVAL; + } + + return qemu_file_get_error(f); +} + +/* ---------------------------------------------------------------------- = */ + +static bool vfio_is_active_iterate(void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + + if (vbasedev->vm_running && vbasedev->migration && + (vbasedev->migration->pending_precopy_only !=3D 0)) + return true; + + if (!vbasedev->vm_running && vbasedev->migration && + (vbasedev->migration->pending_postcopy !=3D 0)) + return true; + + return false; +} + +static int vfio_save_setup(QEMUFile *f, void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + int ret; + + qemu_put_be64(f, VFIO_MIG_FLAG_DEV_SETUP_STATE); + + qemu_mutex_lock_iothread(); + ret =3D vfio_migration_region_init(vbasedev); + qemu_mutex_unlock_iothread(); + if (ret) { + return ret; + } + + qemu_put_be64(f, VFIO_MIG_FLAG_END_OF_STATE); + + ret =3D qemu_file_get_error(f); + if (ret) { + return ret; + } + + return 0; +} + +static int vfio_save_buffer(QEMUFile *f, VFIODevice *vbasedev) +{ + VFIOMigration *migration =3D vbasedev->migration; + uint8_t *buf =3D (uint8_t *)migration->region.buffer.mmaps[0].mmap; + int ret; + struct vfio_device_migration_info migration_info =3D { + .argsz =3D sizeof(migration_info), + .flags =3D VFIO_MIGRATION_GET_BUFFER, + }; + + ret =3D ioctl(vbasedev->fd, VFIO_DEVICE_MIGRATION_INFO, &migration_inf= o); + if (ret < 0) { + error_report("Failed to get migration buffer information %s", + strerror(errno)); + return ret; + } + + qemu_put_be64(f, migration_info.bytes_written); + + if (migration_info.bytes_written) { + qemu_put_buffer(f, buf, migration_info.bytes_written); + } + + ret =3D qemu_file_get_error(f); + if (ret) { + return ret; + } + + return migration_info.bytes_written; +} + +static int vfio_save_iterate(QEMUFile *f, void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + int ret; + + ret =3D vfio_save_buffer(f, vbasedev); + if (ret < 0) { + error_report("vfio_save_buffer failed %s", + strerror(errno)); + return ret; + } + + qemu_put_be64(f, VFIO_MIG_FLAG_END_OF_STATE); + + ret =3D qemu_file_get_error(f); + if (ret) { + return ret; + } + + return ret; +} + +static void vfio_update_pending(VFIODevice *vbasedev, uint64_t threshold_s= ize) +{ + struct vfio_device_migration_info migration_info; + VFIOMigration *migration =3D vbasedev->migration; + int ret; + + migration_info.argsz =3D sizeof(migration_info); + migration_info.flags =3D VFIO_MIGRATION_GET_PENDING; + migration_info.threshold_size =3D threshold_size; + + ret =3D ioctl(vbasedev->fd, VFIO_DEVICE_MIGRATION_INFO, &migration_inf= o); + if (ret < 0) { + error_report("Failed to get pending bytes %s", + strerror(errno)); + return; + } + + migration->pending_precopy_only =3D migration_info.pending_precopy_onl= y; + migration->pending_compatible =3D migration_info.pending_compatible; + migration->pending_postcopy =3D migration_info.pending_postcopy_only; + + return; +} + +static void vfio_save_pending(QEMUFile *f, void *opaque, + uint64_t threshold_size, + uint64_t *res_precopy_only, + uint64_t *res_compatible, + uint64_t *res_postcopy_only) +{ + VFIODevice *vbasedev =3D opaque; + VFIOMigration *migration =3D vbasedev->migration; + + vfio_update_pending(vbasedev, threshold_size); + + *res_precopy_only +=3D migration->pending_precopy_only; + *res_compatible +=3D migration->pending_compatible; + *res_postcopy_only +=3D migration->pending_postcopy; +} + +static int vfio_save_complete_precopy(QEMUFile *f, void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + VFIOMigration *migration =3D vbasedev->migration; + MigrationState *ms =3D migrate_get_current(); + int ret; + + if (vbasedev->vm_running) { + vbasedev->vm_running =3D 0; + } + + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_STOPNCOPY_ACT= IVE); + if (ret) { + error_report("Failed to set state STOPNCOPY_ACTIVE"); + return ret; + } + + ret =3D vfio_save_device_config_state(f, opaque); + if (ret) { + return ret; + } + + do { + vfio_update_pending(vbasedev, ms->threshold_size); + + if (vfio_is_active_iterate(opaque)) { + ret =3D vfio_save_buffer(f, vbasedev); + if (ret < 0) { + error_report("Failed to save buffer"); + break; + } else if (ret =3D=3D 0) { + break; + } + } + } while ((migration->pending_compatible + migration->pending_postcopy)= > 0); + + qemu_put_be64(f, VFIO_MIG_FLAG_END_OF_STATE); + + ret =3D qemu_file_get_error(f); + if (ret) { + return ret; + } + + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_SAVE_COMPLE= TED); + if (ret) { + error_report("Failed to set state SAVE_COMPLETED"); + return ret; + } + return ret; +} + +static void vfio_save_cleanup(void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + + vfio_migration_region_exit(vbasedev); +} + +static int vfio_load_state(QEMUFile *f, void *opaque, int version_id) +{ + VFIODevice *vbasedev =3D opaque; + VFIOMigration *migration =3D vbasedev->migration; + uint8_t *buf =3D (uint8_t *)migration->region.buffer.mmaps[0].mmap; + int ret; + uint64_t data; + + data =3D qemu_get_be64(f); + while (data !=3D VFIO_MIG_FLAG_END_OF_STATE) { + if (data =3D=3D VFIO_MIG_FLAG_DEV_CONFIG_STATE) { + ret =3D vfio_load_device_config_state(f, opaque); + if (ret) { + return ret; + } + } else if (data =3D=3D VFIO_MIG_FLAG_DEV_SETUP_STATE) { + data =3D qemu_get_be64(f); + if (data =3D=3D VFIO_MIG_FLAG_END_OF_STATE) { + return 0; + } else { + error_report("SETUP STATE: EOS not found 0x%lx", data); + return -EINVAL; + } + } else if (data !=3D 0) { + struct vfio_device_migration_info migration_info =3D { + .argsz =3D sizeof(migration_info), + .flags =3D VFIO_MIGRATION_SET_BUFFER, + }; + + qemu_get_buffer(f, buf, data); + migration_info.bytes_written =3D data; + + ret =3D ioctl(vbasedev->fd, + VFIO_DEVICE_MIGRATION_INFO, + &migration_info); + if (ret < 0) { + error_report("Failed to set migration buffer information %= s", + strerror(errno)); + return ret; + } + } + + ret =3D qemu_file_get_error(f); + if (ret) { + return ret; + } + data =3D qemu_get_be64(f); + } + + return 0; +} + +static int vfio_load_setup(QEMUFile *f, void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + int ret; + + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_RESUME); + if (ret) { + error_report("Failed to set state RESUME"); + } + + ret =3D vfio_migration_region_init(vbasedev); + if (ret) { + error_report("Failed to initialise migration region"); + return ret; + } + + return 0; +} + +static int vfio_load_cleanup(void *opaque) +{ + VFIODevice *vbasedev =3D opaque; + int ret =3D 0; + + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_RESUME_COMPLE= TED); + if (ret) { + error_report("Failed to set state RESUME_COMPLETED"); + } + + vfio_migration_region_exit(vbasedev); + return ret; +} + +static SaveVMHandlers savevm_vfio_handlers =3D { + .save_setup =3D vfio_save_setup, + .save_live_iterate =3D vfio_save_iterate, + .save_live_complete_precopy =3D vfio_save_complete_precopy, + .save_live_pending =3D vfio_save_pending, + .save_cleanup =3D vfio_save_cleanup, + .load_state =3D vfio_load_state, + .load_setup =3D vfio_load_setup, + .load_cleanup =3D vfio_load_cleanup, + .is_active_iterate =3D vfio_is_active_iterate, +}; + +static void vfio_vmstate_change(void *opaque, int running, RunState state) +{ + VFIODevice *vbasedev =3D opaque; + + if ((vbasedev->vm_running !=3D running) && running) { + int ret; + + ret =3D vfio_migration_set_state(vbasedev, VFIO_DEVICE_STATE_RUNNI= NG); + if (ret) { + error_report("Failed to set state RUNNING"); + } + } + + vbasedev->vm_running =3D running; +} + +static void vfio_migration_state_notifier(Notifier *notifier, void *data) +{ + MigrationState *s =3D data; + VFIODevice *vbasedev =3D container_of(notifier, VFIODevice, migration_= state); + int ret; + + switch (s->state) { + case MIGRATION_STATUS_SETUP: + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_SETUP); + if (ret) { + error_report("Failed to set state SETUP"); + } + return; + + case MIGRATION_STATUS_ACTIVE: + if (vbasedev->device_state =3D=3D VFIO_DEVICE_STATE_MIGRATION_SETU= P) { + if (vbasedev->vm_running) { + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_PRECOPY_AC= TIVE); + if (ret) { + error_report("Failed to set state PRECOPY_ACTIVE"); + } + } else { + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_STOPNCOPY_ACT= IVE); + if (ret) { + error_report("Failed to set state STOPNCOPY_ACTIVE"); + } + } + } else { + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_RES= UME); + if (ret) { + error_report("Failed to set state RESUME"); + } + } + return; + + case MIGRATION_STATUS_CANCELLING: + case MIGRATION_STATUS_CANCELLED: + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_CANCELL= ED); + if (ret) { + error_report("Failed to set state CANCELLED"); + } + return; + + case MIGRATION_STATUS_FAILED: + ret =3D vfio_migration_set_state(vbasedev, + VFIO_DEVICE_STATE_MIGRATION_FAILED); + if (ret) { + error_report("Failed to set state FAILED"); + } + return; + } +} + +static int vfio_migration_init(VFIODevice *vbasedev) +{ + register_savevm_live(NULL, "vfio", -1, 1, &savevm_vfio_handlers, vbase= dev); + vbasedev->vm_state =3D qemu_add_vm_change_state_handler(vfio_vmstate_c= hange, + vbasedev); + + vbasedev->migration_state.notify =3D vfio_migration_state_notifier; + add_migration_state_change_notifier(&vbasedev->migration_state); + + return 0; +} + + +/* ---------------------------------------------------------------------- = */ + +int vfio_migration_probe(VFIODevice *vbasedev, Error **errp) +{ + struct vfio_device_migration_info probe; + Error *local_err =3D NULL; + int ret; + + memset(&probe, 0, sizeof(probe)); + probe.argsz =3D sizeof(probe); + probe.flags =3D VFIO_MIGRATION_PROBE; + ret =3D ioctl(vbasedev->fd, VFIO_DEVICE_MIGRATION_INFO, &probe); + + if (ret =3D=3D 0) { + return vfio_migration_init(vbasedev); + } + + error_setg(&vbasedev->migration_blocker, + "VFIO device doesn't support migration"); + ret =3D migrate_add_blocker(vbasedev->migration_blocker, &local_err); + if (local_err) { + error_propagate(errp, local_err); + error_free(vbasedev->migration_blocker); + return ret; + } + + return 0; +} + +void vfio_migration_finalize(VFIODevice *vbasedev) +{ + if (vbasedev->vm_state) { + qemu_del_vm_change_state_handler(vbasedev->vm_state); + remove_migration_state_change_notifier(&vbasedev->migration_state); + } + + if (vbasedev->migration_blocker) { + migrate_del_blocker(vbasedev->migration_blocker); + error_free(vbasedev->migration_blocker); + } +} diff --git a/include/hw/vfio/vfio-common.h b/include/hw/vfio/vfio-common.h index a9036929b220..ab8217c9e249 100644 --- a/include/hw/vfio/vfio-common.h +++ b/include/hw/vfio/vfio-common.h @@ -30,6 +30,8 @@ #include #endif =20 +#include "sysemu/sysemu.h" + #define ERR_PREFIX "vfio error: %s: " #define WARN_PREFIX "vfio warning: %s: " =20 @@ -57,6 +59,16 @@ typedef struct VFIORegion { uint8_t nr; /* cache the region number for debug */ } VFIORegion; =20 +typedef struct VFIOMigration { + struct { + VFIORegion buffer; + uint32_t index; + } region; + uint64_t pending_precopy_only; + uint64_t pending_compatible; + uint64_t pending_postcopy; +} VFIOMigration; + typedef struct VFIOAddressSpace { AddressSpace *as; QLIST_HEAD(, VFIOContainer) containers; @@ -116,6 +128,12 @@ typedef struct VFIODevice { unsigned int num_irqs; unsigned int num_regions; unsigned int flags; + uint32_t device_state; + VMChangeStateEntry *vm_state; + int vm_running; + Notifier migration_state; + VFIOMigration *migration; + Error *migration_blocker; } VFIODevice; =20 struct VFIODeviceOps { @@ -193,4 +211,9 @@ int vfio_spapr_create_window(VFIOContainer *container, int vfio_spapr_remove_window(VFIOContainer *container, hwaddr offset_within_address_space); =20 +int vfio_migration_probe(VFIODevice *vbasedev, Error **errp); +void vfio_migration_finalize(VFIODevice *vbasedev); +void vfio_get_dirty_page_list(VFIODevice *vbasedev, uint64_t start_addr, + uint64_t pfn_count); + #endif /* HW_VFIO_VFIO_COMMON_H */ --=20 2.7.0