[PATCH] Add page_per_vq flag to the 'driver' element of virtio devices

Gavi Teitz posted 1 patch 2 years, 11 months ago
Test syntax-check failed
Patches applied successfully (tree, apply log)
git fetch https://github.com/patchew-project/libvirt tags/patchew/1619694764-5190-1-git-send-email-gavi@nvidia.com
There is a newer version of this series
docs/formatdomain.rst                              | 11 +++++-
docs/schemas/domaincommon.rng                      |  5 +++
src/conf/domain_conf.c                             | 15 ++++++++
src/conf/domain_conf.h                             |  1 +
src/qemu/qemu_capabilities.c                       |  2 +
src/qemu/qemu_capabilities.h                       |  1 +
src/qemu/qemu_command.c                            |  5 +++
src/qemu/qemu_hotplug.c                            |  1 +
tests/qemuxml2argvdata/net-virtio-page-per-vq.args | 31 ++++++++++++++++
tests/qemuxml2argvdata/net-virtio-page-per-vq.xml  | 29 +++++++++++++++
tests/qemuxml2argvtest.c                           |  2 +
.../qemuxml2xmloutdata/net-virtio-page-per-vq.xml  | 43 ++++++++++++++++++++++
tests/qemuxml2xmltest.c                            |  2 +
13 files changed, 147 insertions(+), 1 deletion(-)
create mode 100644 tests/qemuxml2argvdata/net-virtio-page-per-vq.args
create mode 100644 tests/qemuxml2argvdata/net-virtio-page-per-vq.xml
create mode 100644 tests/qemuxml2xmloutdata/net-virtio-page-per-vq.xml
[PATCH] Add page_per_vq flag to the 'driver' element of virtio devices
Posted by Gavi Teitz 2 years, 11 months ago
https://bugzilla.redhat.com/show_bug.cgi?id=1925363

Add support for setting the page-per-vq flag, which is important for
vdpa with vhost-user performance.

Signed-off-by: Gavi Teitz <gavi@nvidia.com>
---
 docs/formatdomain.rst                              | 11 +++++-
 docs/schemas/domaincommon.rng                      |  5 +++
 src/conf/domain_conf.c                             | 15 ++++++++
 src/conf/domain_conf.h                             |  1 +
 src/qemu/qemu_capabilities.c                       |  2 +
 src/qemu/qemu_capabilities.h                       |  1 +
 src/qemu/qemu_command.c                            |  5 +++
 src/qemu/qemu_hotplug.c                            |  1 +
 tests/qemuxml2argvdata/net-virtio-page-per-vq.args | 31 ++++++++++++++++
 tests/qemuxml2argvdata/net-virtio-page-per-vq.xml  | 29 +++++++++++++++
 tests/qemuxml2argvtest.c                           |  2 +
 .../qemuxml2xmloutdata/net-virtio-page-per-vq.xml  | 43 ++++++++++++++++++++++
 tests/qemuxml2xmltest.c                            |  2 +
 13 files changed, 147 insertions(+), 1 deletion(-)
 create mode 100644 tests/qemuxml2argvdata/net-virtio-page-per-vq.args
 create mode 100644 tests/qemuxml2argvdata/net-virtio-page-per-vq.xml
 create mode 100644 tests/qemuxml2xmloutdata/net-virtio-page-per-vq.xml

diff --git a/docs/formatdomain.rst b/docs/formatdomain.rst
index 282176c..f5a9c70 100644
--- a/docs/formatdomain.rst
+++ b/docs/formatdomain.rst
@@ -5120,7 +5120,7 @@ Setting NIC driver-specific options
        <source network='default'/>
        <target dev='vnet1'/>
        <model type='virtio'/>
-       <driver name='vhost' txmode='iothread' ioeventfd='on' event_idx='off' queues='5' rx_queue_size='256' tx_queue_size='256'>
+       <driver name='vhost' txmode='iothread' ioeventfd='on' event_idx='off' queues='5' rx_queue_size='256' tx_queue_size='256' page_per_vq='on'>
          <host csum='off' gso='off' tso4='off' tso6='off' ecn='off' ufo='off' mrg_rxbuf='off'/>
          <guest csum='off' tso4='off' tso6='off' ecn='off' ufo='off'/>
        </driver>
@@ -5215,6 +5215,15 @@ following attributes are available for the ``"virtio"`` NIC driver:
    only for ``vhostuser`` type. :since:`Since 3.7.0 (QEMU and KVM only)`
    **In general you should leave this option alone, unless you are very certain
    you know what you are doing.**
+``page_per_vq``
+   This optional attribute controls the layout of the notification capabilities
+   exposed to the guest. When enabled, each virtio queue will have a dedicated
+   page on the device BAR exposed to the guest. It is recommended to be used when
+   vDPA is enabled on the hypervisor, as it enables mapping the notification area
+   to the physical device, which is only supported in page granularity. The
+   default is determined by QEMU; as off. :since:`Since 2.8.0 (QEMU only)`
+   **In general you should leave this option alone, unless you are very certain
+   you know what you are doing.**
 virtio options
    For virtio interfaces, `Virtio-specific options <#elementsVirtio>`__ can also
    be set. ( :since:`Since 3.5.0` )
diff --git a/docs/schemas/domaincommon.rng b/docs/schemas/domaincommon.rng
index a2e5c50..e61ad67 100644
--- a/docs/schemas/domaincommon.rng
+++ b/docs/schemas/domaincommon.rng
@@ -3463,6 +3463,11 @@
                 </attribute>
               </optional>
               <optional>
+                <attribute name="page_per_vq">
+                  <ref name="virOnOff"/>
+                </attribute>
+              </optional>
+              <optional>
                 <attribute name="txmode">
                   <choice>
                     <value>iothread</value>
diff --git a/src/conf/domain_conf.c b/src/conf/domain_conf.c
index 9d98f48..0350fde 100644
--- a/src/conf/domain_conf.c
+++ b/src/conf/domain_conf.c
@@ -10446,6 +10446,7 @@ virDomainNetDefParseXML(virDomainXMLOption *xmlopt,
     g_autofree char *queues = NULL;
     g_autofree char *rx_queue_size = NULL;
     g_autofree char *tx_queue_size = NULL;
+    g_autofree char *page_per_vq = NULL;
     g_autofree char *filter = NULL;
     g_autofree char *internal = NULL;
     g_autofree char *mode = NULL;
@@ -10615,6 +10616,7 @@ virDomainNetDefParseXML(virDomainXMLOption *xmlopt,
                 queues = virXMLPropString(cur, "queues");
                 rx_queue_size = virXMLPropString(cur, "rx_queue_size");
                 tx_queue_size = virXMLPropString(cur, "tx_queue_size");
+                page_per_vq = virXMLPropString(cur, "page_per_vq");
 
                 if (virDomainVirtioOptionsParseXML(cur, &def->virtio) < 0)
                     goto error;
@@ -11041,6 +11043,15 @@ virDomainNetDefParseXML(virDomainXMLOption *xmlopt,
             }
             def->driver.virtio.tx_queue_size = q;
         }
+        if (page_per_vq) {
+            if ((val = virTristateSwitchTypeFromString(page_per_vq)) <= 0) {
+                virReportError(VIR_ERR_CONFIG_UNSUPPORTED,
+                               _("unknown page_per_vq mode '%s'"),
+                               page_per_vq);
+                goto error;
+            }
+            def->driver.virtio.page_per_vq = val;
+        }
 
         if ((tmpNode = virXPathNode("./driver/host", ctxt))) {
             if (virXMLPropTristateSwitch(tmpNode, "csum", VIR_XML_PROP_NONE,
@@ -25453,6 +25464,10 @@ virDomainVirtioNetDriverFormat(virBuffer *buf,
     if (def->driver.virtio.tx_queue_size)
         virBufferAsprintf(buf, " tx_queue_size='%u'",
                           def->driver.virtio.tx_queue_size);
+    if (def->driver.virtio.page_per_vq) {
+        virBufferAsprintf(buf, " page_per_vq='%s'",
+                          virTristateSwitchTypeToString(def->driver.virtio.page_per_vq));
+    }
 
     virDomainVirtioOptionsFormat(buf, def->virtio);
 }
diff --git a/src/conf/domain_conf.h b/src/conf/domain_conf.h
index 85c318d..7aab565 100644
--- a/src/conf/domain_conf.h
+++ b/src/conf/domain_conf.h
@@ -1027,6 +1027,7 @@ struct _virDomainNetDef {
             virDomainNetVirtioTxModeType txmode;
             virTristateSwitch ioeventfd;
             virTristateSwitch event_idx;
+            virTristateSwitch page_per_vq;
             unsigned int queues; /* Multiqueue virtio-net */
             unsigned int rx_queue_size;
             unsigned int tx_queue_size;
diff --git a/src/qemu/qemu_capabilities.c b/src/qemu/qemu_capabilities.c
index 7971a9c..e5646cb 100644
--- a/src/qemu/qemu_capabilities.c
+++ b/src/qemu/qemu_capabilities.c
@@ -629,6 +629,7 @@ VIR_ENUM_IMPL(virQEMUCaps,
               /* 400 */
               "compat-deprecated",
               "acpi-index",
+              "virtio-net.page_per_vq",
     );
 
 
@@ -1405,6 +1406,7 @@ static struct virQEMUCapsDevicePropsFlags virQEMUCapsDevicePropsVirtioNet[] = {
     { "event_idx", QEMU_CAPS_VIRTIO_NET_EVENT_IDX, NULL },
     { "rx_queue_size", QEMU_CAPS_VIRTIO_NET_RX_QUEUE_SIZE, NULL },
     { "tx_queue_size", QEMU_CAPS_VIRTIO_NET_TX_QUEUE_SIZE, NULL },
+    { "page_per_vq", QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ, NULL },
     { "host_mtu", QEMU_CAPS_VIRTIO_NET_HOST_MTU, NULL },
     { "disable-legacy", QEMU_CAPS_VIRTIO_PCI_DISABLE_LEGACY, NULL },
     { "iommu_platform", QEMU_CAPS_VIRTIO_PCI_IOMMU_PLATFORM, NULL },
diff --git a/src/qemu/qemu_capabilities.h b/src/qemu/qemu_capabilities.h
index f54aad5..75feed4 100644
--- a/src/qemu/qemu_capabilities.h
+++ b/src/qemu/qemu_capabilities.h
@@ -609,6 +609,7 @@ typedef enum { /* virQEMUCapsFlags grouping marker for syntax-check */
     /* 400 */
     QEMU_CAPS_COMPAT_DEPRECATED, /* -compat deprecated-(input|output) is supported */
     QEMU_CAPS_ACPI_INDEX, /* PCI device 'acpi-index' property */
+    QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ, /* virtio-net-*.page_per_vq */
 
     QEMU_CAPS_LAST /* this must always be the last item */
 } virQEMUCapsFlags;
diff --git a/src/qemu/qemu_command.c b/src/qemu/qemu_command.c
index d7f1c71..7b5834f 100644
--- a/src/qemu/qemu_command.c
+++ b/src/qemu/qemu_command.c
@@ -3630,6 +3630,11 @@ qemuBuildNicDevStr(virDomainDef *def,
         if (net->driver.virtio.tx_queue_size)
             virBufferAsprintf(&buf, ",tx_queue_size=%u", net->driver.virtio.tx_queue_size);
 
+        if (net->driver.virtio.page_per_vq) {
+            virBufferAsprintf(&buf, ",page-per-vq=%s",
+                              virTristateSwitchTypeToString(net->driver.virtio.page_per_vq));
+        }
+
         if (net->mtu)
             virBufferAsprintf(&buf, ",host_mtu=%u", net->mtu);
 
diff --git a/src/qemu/qemu_hotplug.c b/src/qemu/qemu_hotplug.c
index 4344edc..f79c3d8 100644
--- a/src/qemu/qemu_hotplug.c
+++ b/src/qemu/qemu_hotplug.c
@@ -3578,6 +3578,7 @@ qemuDomainChangeNet(virQEMUDriver *driver,
          olddev->driver.virtio.queues != newdev->driver.virtio.queues ||
          olddev->driver.virtio.rx_queue_size != newdev->driver.virtio.rx_queue_size ||
          olddev->driver.virtio.tx_queue_size != newdev->driver.virtio.tx_queue_size ||
+         olddev->driver.virtio.page_per_vq != newdev->driver.virtio.page_per_vq ||
          olddev->driver.virtio.host.csum != newdev->driver.virtio.host.csum ||
          olddev->driver.virtio.host.gso != newdev->driver.virtio.host.gso ||
          olddev->driver.virtio.host.tso4 != newdev->driver.virtio.host.tso4 ||
diff --git a/tests/qemuxml2argvdata/net-virtio-page-per-vq.args b/tests/qemuxml2argvdata/net-virtio-page-per-vq.args
new file mode 100644
index 0000000..d7db71f
--- /dev/null
+++ b/tests/qemuxml2argvdata/net-virtio-page-per-vq.args
@@ -0,0 +1,31 @@
+LC_ALL=C \
+PATH=/bin \
+HOME=/tmp/lib/domain--1-QEMUGuest1 \
+USER=test \
+LOGNAME=test \
+XDG_DATA_HOME=/tmp/lib/domain--1-QEMUGuest1/.local/share \
+XDG_CACHE_HOME=/tmp/lib/domain--1-QEMUGuest1/.cache \
+XDG_CONFIG_HOME=/tmp/lib/domain--1-QEMUGuest1/.config \
+QEMU_AUDIO_DRV=none \
+/usr/bin/qemu-system-i386 \
+-name QEMUGuest1 \
+-S \
+-machine pc,accel=tcg,usb=off,dump-guest-core=off \
+-m 214 \
+-realtime mlock=off \
+-smp 1,sockets=1,cores=1,threads=1 \
+-uuid c7a5fdbd-edaf-9455-926a-d65c16db1809 \
+-display none \
+-no-user-config \
+-nodefaults \
+-chardev socket,id=charmonitor,path=/tmp/lib/domain--1-QEMUGuest1/monitor.sock,server=on,wait=off \
+-mon chardev=charmonitor,id=monitor,mode=control \
+-rtc base=utc \
+-no-shutdown \
+-no-acpi \
+-usb \
+-drive file=/dev/HostVG/QEMUGuest1,format=raw,if=none,id=drive-ide0-0-0 \
+-device ide-hd,bus=ide.0,unit=0,drive=drive-ide0-0-0,id=ide0-0-0,bootindex=1 \
+-netdev user,id=hostnet0 \
+-device virtio-net-pci,page-per-vq=on,netdev=hostnet0,id=net0,mac=00:11:22:33:44:55,bus=pci.0,addr=0x3 \
+-device virtio-balloon-pci,id=balloon0,bus=pci.0,addr=0x4
diff --git a/tests/qemuxml2argvdata/net-virtio-page-per-vq.xml b/tests/qemuxml2argvdata/net-virtio-page-per-vq.xml
new file mode 100644
index 0000000..e22ecd6
--- /dev/null
+++ b/tests/qemuxml2argvdata/net-virtio-page-per-vq.xml
@@ -0,0 +1,29 @@
+<domain type='qemu'>
+  <name>QEMUGuest1</name>
+  <uuid>c7a5fdbd-edaf-9455-926a-d65c16db1809</uuid>
+  <memory unit='KiB'>219100</memory>
+  <currentMemory unit='KiB'>219100</currentMemory>
+  <vcpu placement='static'>1</vcpu>
+  <os>
+    <type arch='i686' machine='pc'>hvm</type>
+    <boot dev='hd'/>
+  </os>
+  <clock offset='utc'/>
+  <on_poweroff>destroy</on_poweroff>
+  <on_reboot>restart</on_reboot>
+  <on_crash>destroy</on_crash>
+  <devices>
+    <emulator>/usr/bin/qemu-system-i386</emulator>
+    <disk type='block' device='disk'>
+      <source dev='/dev/HostVG/QEMUGuest1'/>
+      <target dev='hda' bus='ide'/>
+    </disk>
+    <controller type='usb' index='0'/>
+    <interface type='user'>
+      <mac address='00:11:22:33:44:55'/>
+      <model type='virtio'/>
+      <driver page_per_vq='on'/>
+    </interface>
+    <memballoon model='virtio'/>
+  </devices>
+</domain>
diff --git a/tests/qemuxml2argvtest.c b/tests/qemuxml2argvtest.c
index f0efe98..f6b8f34 100644
--- a/tests/qemuxml2argvtest.c
+++ b/tests/qemuxml2argvtest.c
@@ -1654,6 +1654,8 @@ mymain(void)
             QEMU_CAPS_VIRTIO_NET_TX_QUEUE_SIZE);
     DO_TEST_PARSE_ERROR("net-virtio-rxqueuesize-invalid-size",
                         QEMU_CAPS_VIRTIO_NET_RX_QUEUE_SIZE);
+    DO_TEST("net-virtio-page-per-vq",
+            QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ);
     DO_TEST("net-virtio-teaming",
             QEMU_CAPS_VIRTIO_NET_FAILOVER,
             QEMU_CAPS_DEVICE_VFIO_PCI);
diff --git a/tests/qemuxml2xmloutdata/net-virtio-page-per-vq.xml b/tests/qemuxml2xmloutdata/net-virtio-page-per-vq.xml
new file mode 100644
index 0000000..a35ed8a
--- /dev/null
+++ b/tests/qemuxml2xmloutdata/net-virtio-page-per-vq.xml
@@ -0,0 +1,43 @@
+<domain type='qemu'>
+  <name>QEMUGuest1</name>
+  <uuid>c7a5fdbd-edaf-9455-926a-d65c16db1809</uuid>
+  <memory unit='KiB'>219100</memory>
+  <currentMemory unit='KiB'>219100</currentMemory>
+  <vcpu placement='static'>1</vcpu>
+  <os>
+    <type arch='i686' machine='pc'>hvm</type>
+    <boot dev='hd'/>
+  </os>
+  <clock offset='utc'/>
+  <on_poweroff>destroy</on_poweroff>
+  <on_reboot>restart</on_reboot>
+  <on_crash>destroy</on_crash>
+  <devices>
+    <emulator>/usr/bin/qemu-system-i386</emulator>
+    <disk type='block' device='disk'>
+      <driver name='qemu' type='raw'/>
+      <source dev='/dev/HostVG/QEMUGuest1'/>
+      <target dev='hda' bus='ide'/>
+      <address type='drive' controller='0' bus='0' target='0' unit='0'/>
+    </disk>
+    <controller type='usb' index='0'>
+      <address type='pci' domain='0x0000' bus='0x00' slot='0x01' function='0x2'/>
+    </controller>
+    <controller type='pci' index='0' model='pci-root'/>
+    <controller type='ide' index='0'>
+      <address type='pci' domain='0x0000' bus='0x00' slot='0x01' function='0x1'/>
+    </controller>
+    <interface type='user'>
+      <mac address='00:11:22:33:44:55'/>
+      <model type='virtio'/>
+      <driver page_per_vq='on'/>
+      <address type='pci' domain='0x0000' bus='0x00' slot='0x03' function='0x0'/>
+    </interface>
+    <input type='mouse' bus='ps2'/>
+    <input type='keyboard' bus='ps2'/>
+    <audio id='1' type='none'/>
+    <memballoon model='virtio'>
+      <address type='pci' domain='0x0000' bus='0x00' slot='0x04' function='0x0'/>
+    </memballoon>
+  </devices>
+</domain>
diff --git a/tests/qemuxml2xmltest.c b/tests/qemuxml2xmltest.c
index c37de0c..2bd6eea 100644
--- a/tests/qemuxml2xmltest.c
+++ b/tests/qemuxml2xmltest.c
@@ -450,6 +450,8 @@ mymain(void)
     DO_TEST("net-virtio-rxtxqueuesize",
             QEMU_CAPS_VIRTIO_NET_RX_QUEUE_SIZE,
             QEMU_CAPS_VIRTIO_NET_TX_QUEUE_SIZE);
+    DO_TEST("net-virtio-page-per-vq",
+            QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ);
     DO_TEST("net-virtio-teaming",
             QEMU_CAPS_VIRTIO_NET_FAILOVER,
             QEMU_CAPS_DEVICE_VFIO_PCI);
-- 
1.8.3.1

Re: [PATCH] Add page_per_vq flag to the 'driver' element of virtio devices
Posted by Jonathon Jongsma 2 years, 11 months ago
On Thu, 2021-04-29 at 14:12 +0300, Gavi Teitz wrote:
> https://bugzilla.redhat.com/show_bug.cgi?id=1925363
> 
> Add support for setting the page-per-vq flag, which is important for
> vdpa with vhost-user performance.
> 
> Signed-off-by: Gavi Teitz <gavi@nvidia.com>
> 

[snip]

> diff --git a/src/conf/domain_conf.c b/src/conf/domain_conf.c
> index 9d98f48..0350fde 100644
> --- a/src/conf/domain_conf.c
> +++ b/src/conf/domain_conf.c
> @@ -10446,6 +10446,7 @@ virDomainNetDefParseXML(virDomainXMLOption
> *xmlopt,
>      g_autofree char *queues = NULL;
>      g_autofree char *rx_queue_size = NULL;
>      g_autofree char *tx_queue_size = NULL;
> +    g_autofree char *page_per_vq = NULL;
>      g_autofree char *filter = NULL;
>      g_autofree char *internal = NULL;
>      g_autofree char *mode = NULL;
> @@ -10615,6 +10616,7 @@ virDomainNetDefParseXML(virDomainXMLOption
> *xmlopt,
>                  queues = virXMLPropString(cur, "queues");
>                  rx_queue_size = virXMLPropString(cur,
> "rx_queue_size");
>                  tx_queue_size = virXMLPropString(cur,
> "tx_queue_size");
> +                page_per_vq = virXMLPropString(cur, "page_per_vq");
>  
>                  if (virDomainVirtioOptionsParseXML(cur, &def-
> >virtio) < 0)
>                      goto error;
> @@ -11041,6 +11043,15 @@ virDomainNetDefParseXML(virDomainXMLOption
> *xmlopt,
>              }
>              def->driver.virtio.tx_queue_size = q;
>          }
> +        if (page_per_vq) {
> +            if ((val = virTristateSwitchTypeFromString(page_per_vq))
> <= 0) {
> +                virReportError(VIR_ERR_CONFIG_UNSUPPORTED,
> +                               _("unknown page_per_vq mode '%s'"),
> +                               page_per_vq);
> +                goto error;
> +            }
> +            def->driver.virtio.page_per_vq = val;
> +        }

Tim Wiederhake has been doing a bunch of refactoring work in this area
changing code like this to use the newish virXMLProp* functions. 

So I think this whole chunk could instead be something like:

if (virXMLPropTristateSwitch(cur, "page_per_vq", VIR_XML_PROP_NONE, 
			     &def->driver.virtio.page_per_vq) < 0)
    return -1;


(untested)

Jonathon

Re: [PATCH] Add page_per_vq flag to the 'driver' element of virtio devices
Posted by Gavi Teitz 2 years, 11 months ago
On 4/29/21 5:43 PM, Jonathon Jongsma wrote:
> On Thu, 2021-04-29 at 14:12 +0300, Gavi Teitz wrote:
>> https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbugzilla.redhat.com%2Fshow_bug.cgi%3Fid%3D1925363&amp;data=04%7C01%7Cgavi%40nvidia.com%7C3a0a9dc902bd4ddb90fa08d90b1d3ac9%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C637553042452946162%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=0xd9ZcmXQJPT%2FeqT%2Fs%2B0KOc5HnMN447pE6r7V6bc%2B98%3D&amp;reserved=0
>>
>> Add support for setting the page-per-vq flag, which is important for
>> vdpa with vhost-user performance.
>>
>> Signed-off-by: Gavi Teitz <gavi@nvidia.com>
>>
> [snip]
>
>> diff --git a/src/conf/domain_conf.c b/src/conf/domain_conf.c
>> index 9d98f48..0350fde 100644
>> --- a/src/conf/domain_conf.c
>> +++ b/src/conf/domain_conf.c
>> @@ -10446,6 +10446,7 @@ virDomainNetDefParseXML(virDomainXMLOption
>> *xmlopt,
>>       g_autofree char *queues = NULL;
>>       g_autofree char *rx_queue_size = NULL;
>>       g_autofree char *tx_queue_size = NULL;
>> +    g_autofree char *page_per_vq = NULL;
>>       g_autofree char *filter = NULL;
>>       g_autofree char *internal = NULL;
>>       g_autofree char *mode = NULL;
>> @@ -10615,6 +10616,7 @@ virDomainNetDefParseXML(virDomainXMLOption
>> *xmlopt,
>>                   queues = virXMLPropString(cur, "queues");
>>                   rx_queue_size = virXMLPropString(cur,
>> "rx_queue_size");
>>                   tx_queue_size = virXMLPropString(cur,
>> "tx_queue_size");
>> +                page_per_vq = virXMLPropString(cur, "page_per_vq");
>>   
>>                   if (virDomainVirtioOptionsParseXML(cur, &def-
>>> virtio) < 0)
>>                       goto error;
>> @@ -11041,6 +11043,15 @@ virDomainNetDefParseXML(virDomainXMLOption
>> *xmlopt,
>>               }
>>               def->driver.virtio.tx_queue_size = q;
>>           }
>> +        if (page_per_vq) {
>> +            if ((val = virTristateSwitchTypeFromString(page_per_vq))
>> <= 0) {
>> +                virReportError(VIR_ERR_CONFIG_UNSUPPORTED,
>> +                               _("unknown page_per_vq mode '%s'"),
>> +                               page_per_vq);
>> +                goto error;
>> +            }
>> +            def->driver.virtio.page_per_vq = val;
>> +        }
> Tim Wiederhake has been doing a bunch of refactoring work in this area
> changing code like this to use the newish virXMLProp* functions.
>
> So I think this whole chunk could instead be something like:
>
> if (virXMLPropTristateSwitch(cur, "page_per_vq", VIR_XML_PROP_NONE,
> 			     &def->driver.virtio.page_per_vq) < 0)
>      return -1;
>
>
> (untested)
>
> Jonathon
>
Assigning def->driver.virtio.page_per_vq in the initial parsing loop 
with the suggested lines would prevent the assignment from being 
conditioned upon virDomainNetIsVirtioModel() as it currently is.

Is it not necessary to enforce this condition, or should I instead do 
the assignment as I currently do, after the parsing loop and under the 
condition, and use virXPathNode("./driver", ctxt) to get the node?


Re: [PATCH] Add page_per_vq flag to the 'driver' element of virtio devices
Posted by Peter Krempa 2 years, 11 months ago
On Thu, Apr 29, 2021 at 14:12:44 +0300, Gavi Teitz wrote:
> https://bugzilla.redhat.com/show_bug.cgi?id=1925363
> 
> Add support for setting the page-per-vq flag, which is important for
> vdpa with vhost-user performance.
> 
> Signed-off-by: Gavi Teitz <gavi@nvidia.com>
> ---
>  docs/formatdomain.rst                              | 11 +++++-
>  docs/schemas/domaincommon.rng                      |  5 +++
>  src/conf/domain_conf.c                             | 15 ++++++++
>  src/conf/domain_conf.h                             |  1 +
>  src/qemu/qemu_capabilities.c                       |  2 +
>  src/qemu/qemu_capabilities.h                       |  1 +
>  src/qemu/qemu_command.c                            |  5 +++
>  src/qemu/qemu_hotplug.c                            |  1 +
>  tests/qemuxml2argvdata/net-virtio-page-per-vq.args | 31 ++++++++++++++++
>  tests/qemuxml2argvdata/net-virtio-page-per-vq.xml  | 29 +++++++++++++++
>  tests/qemuxml2argvtest.c                           |  2 +
>  .../qemuxml2xmloutdata/net-virtio-page-per-vq.xml  | 43 ++++++++++++++++++++++
>  tests/qemuxml2xmltest.c                            |  2 +
>  13 files changed, 147 insertions(+), 1 deletion(-)
>  create mode 100644 tests/qemuxml2argvdata/net-virtio-page-per-vq.args
>  create mode 100644 tests/qemuxml2argvdata/net-virtio-page-per-vq.xml
>  create mode 100644 tests/qemuxml2xmloutdata/net-virtio-page-per-vq.xml
> 
> diff --git a/docs/formatdomain.rst b/docs/formatdomain.rst
> index 282176c..f5a9c70 100644
> --- a/docs/formatdomain.rst
> +++ b/docs/formatdomain.rst
> @@ -5120,7 +5120,7 @@ Setting NIC driver-specific options
>         <source network='default'/>
>         <target dev='vnet1'/>
>         <model type='virtio'/>
> -       <driver name='vhost' txmode='iothread' ioeventfd='on' event_idx='off' queues='5' rx_queue_size='256' tx_queue_size='256'>
> +       <driver name='vhost' txmode='iothread' ioeventfd='on' event_idx='off' queues='5' rx_queue_size='256' tx_queue_size='256' page_per_vq='on'>
>           <host csum='off' gso='off' tso4='off' tso6='off' ecn='off' ufo='off' mrg_rxbuf='off'/>
>           <guest csum='off' tso4='off' tso6='off' ecn='off' ufo='off'/>
>         </driver>
> @@ -5215,6 +5215,15 @@ following attributes are available for the ``"virtio"`` NIC driver:
>     only for ``vhostuser`` type. :since:`Since 3.7.0 (QEMU and KVM only)`
>     **In general you should leave this option alone, unless you are very certain
>     you know what you are doing.**
> +``page_per_vq``
> +   This optional attribute controls the layout of the notification capabilities
> +   exposed to the guest. When enabled, each virtio queue will have a dedicated
> +   page on the device BAR exposed to the guest. It is recommended to be used when
> +   vDPA is enabled on the hypervisor, as it enables mapping the notification area
> +   to the physical device, which is only supported in page granularity. The
> +   default is determined by QEMU; as off. :since:`Since 2.8.0 (QEMU only)`

The 'since' tag is primarily used for the libvirt version which
introduces the feature thus you must include that.

Also note that starting from the next release the minimum supported qemu
version will be bumped to qemu-2.10 thus the qemu version note is not
needed.


> +   **In general you should leave this option alone, unless you are very certain
> +   you know what you are doing.**
>  virtio options
>     For virtio interfaces, `Virtio-specific options <#elementsVirtio>`__ can also
>     be set. ( :since:`Since 3.5.0` )
> diff --git a/docs/schemas/domaincommon.rng b/docs/schemas/domaincommon.rng
> index a2e5c50..e61ad67 100644
> --- a/docs/schemas/domaincommon.rng
> +++ b/docs/schemas/domaincommon.rng
> @@ -3463,6 +3463,11 @@
>                  </attribute>
>                </optional>
>                <optional>
> +                <attribute name="page_per_vq">
> +                  <ref name="virOnOff"/>
> +                </attribute>
> +              </optional>
> +              <optional>
>                  <attribute name="txmode">
>                    <choice>
>                      <value>iothread</value>

[...]

> @@ -1405,6 +1406,7 @@ static struct virQEMUCapsDevicePropsFlags virQEMUCapsDevicePropsVirtioNet[] = {
>      { "event_idx", QEMU_CAPS_VIRTIO_NET_EVENT_IDX, NULL },
>      { "rx_queue_size", QEMU_CAPS_VIRTIO_NET_RX_QUEUE_SIZE, NULL },
>      { "tx_queue_size", QEMU_CAPS_VIRTIO_NET_TX_QUEUE_SIZE, NULL },
> +    { "page_per_vq", QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ, NULL },
>      { "host_mtu", QEMU_CAPS_VIRTIO_NET_HOST_MTU, NULL },
>      { "disable-legacy", QEMU_CAPS_VIRTIO_PCI_DISABLE_LEGACY, NULL },
>      { "iommu_platform", QEMU_CAPS_VIRTIO_PCI_IOMMU_PLATFORM, NULL },
> diff --git a/src/qemu/qemu_capabilities.h b/src/qemu/qemu_capabilities.h
> index f54aad5..75feed4 100644
> --- a/src/qemu/qemu_capabilities.h
> +++ b/src/qemu/qemu_capabilities.h
> @@ -609,6 +609,7 @@ typedef enum { /* virQEMUCapsFlags grouping marker for syntax-check */
>      /* 400 */
>      QEMU_CAPS_COMPAT_DEPRECATED, /* -compat deprecated-(input|output) is supported */
>      QEMU_CAPS_ACPI_INDEX, /* PCI device 'acpi-index' property */
> +    QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ, /* virtio-net-*.page_per_vq */
>  
>      QEMU_CAPS_LAST /* this must always be the last item */
>  } virQEMUCapsFlags;

1) qemu capabilities changes are usually separated into a separate commit
2) missing update to the capability data, either this doesn't detect
   anything or tests would be broken.
3) the capability is not necessary since all qemus supported since
   libvirt-7.4 will have it

> diff --git a/src/qemu/qemu_command.c b/src/qemu/qemu_command.c
> index d7f1c71..7b5834f 100644
> --- a/src/qemu/qemu_command.c
> +++ b/src/qemu/qemu_command.c
> @@ -3630,6 +3630,11 @@ qemuBuildNicDevStr(virDomainDef *def,
>          if (net->driver.virtio.tx_queue_size)
>              virBufferAsprintf(&buf, ",tx_queue_size=%u", net->driver.virtio.tx_queue_size);
>  
> +        if (net->driver.virtio.page_per_vq) {
> +            virBufferAsprintf(&buf, ",page-per-vq=%s",
> +                              virTristateSwitchTypeToString(net->driver.virtio.page_per_vq));
> +        }
> +
>          if (net->mtu)
>              virBufferAsprintf(&buf, ",host_mtu=%u", net->mtu);
>  

[...]

> diff --git a/tests/qemuxml2argvtest.c b/tests/qemuxml2argvtest.c
> index f0efe98..f6b8f34 100644
> --- a/tests/qemuxml2argvtest.c
> +++ b/tests/qemuxml2argvtest.c
> @@ -1654,6 +1654,8 @@ mymain(void)
>              QEMU_CAPS_VIRTIO_NET_TX_QUEUE_SIZE);
>      DO_TEST_PARSE_ERROR("net-virtio-rxqueuesize-invalid-size",
>                          QEMU_CAPS_VIRTIO_NET_RX_QUEUE_SIZE);
> +    DO_TEST("net-virtio-page-per-vq",
> +            QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ);

New positive tests should not use DO_TEST, but DO_TEST_CAPS_LATEST.

>      DO_TEST("net-virtio-teaming",
>              QEMU_CAPS_VIRTIO_NET_FAILOVER,
>              QEMU_CAPS_DEVICE_VFIO_PCI);

[...]

According to the docs this is also a guest-visible change, thus we are
missing a check in the ABI stability checker code which ensures that the
guest ABI doesn't change.

See virDomainDefCheckABIStabilityFlags to find the appropriate place.

Re: [PATCH] Add page_per_vq flag to the 'driver' element of virtio devices
Posted by Gavi Teitz 2 years, 11 months ago
On 4/29/21 2:51 PM, Peter Krempa wrote:

On Thu, Apr 29, 2021 at 14:12:44 +0300, Gavi Teitz wrote:

https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbugzilla.redhat.com%2Fshow_bug.cgi%3Fid%3D1925363&amp;data=04%7C01%7Cgavi%40nvidia.com%7Cb136902485fb47976af208d90b05314f%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C637552939196391048%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=VYgjIiG3yeT56iOHFLqQr%2F22EjGTfc743pcKHH1B18E%3D&amp;reserved=0



Add support for setting the page-per-vq flag, which is important for

vdpa with vhost-user performance.



Signed-off-by: Gavi Teitz <gavi@nvidia.com><mailto:gavi@nvidia.com>

---

 docs/formatdomain.rst                              | 11 +++++-

 docs/schemas/domaincommon.rng                      |  5 +++

 src/conf/domain_conf.c                             | 15 ++++++++

 src/conf/domain_conf.h                             |  1 +

 src/qemu/qemu_capabilities.c                       |  2 +

 src/qemu/qemu_capabilities.h                       |  1 +

 src/qemu/qemu_command.c                            |  5 +++

 src/qemu/qemu_hotplug.c                            |  1 +

 tests/qemuxml2argvdata/net-virtio-page-per-vq.args | 31 ++++++++++++++++

 tests/qemuxml2argvdata/net-virtio-page-per-vq.xml  | 29 +++++++++++++++

 tests/qemuxml2argvtest.c                           |  2 +

 .../qemuxml2xmloutdata/net-virtio-page-per-vq.xml  | 43 ++++++++++++++++++++++

 tests/qemuxml2xmltest.c                            |  2 +

 13 files changed, 147 insertions(+), 1 deletion(-)

 create mode 100644 tests/qemuxml2argvdata/net-virtio-page-per-vq.args

 create mode 100644 tests/qemuxml2argvdata/net-virtio-page-per-vq.xml

 create mode 100644 tests/qemuxml2xmloutdata/net-virtio-page-per-vq.xml



diff --git a/docs/formatdomain.rst b/docs/formatdomain.rst

index 282176c..f5a9c70 100644

--- a/docs/formatdomain.rst

+++ b/docs/formatdomain.rst

@@ -5120,7 +5120,7 @@ Setting NIC driver-specific options

        <source network='default'/>

        <target dev='vnet1'/>

        <model type='virtio'/>

-       <driver name='vhost' txmode='iothread' ioeventfd='on' event_idx='off' queues='5' rx_queue_size='256' tx_queue_size='256'>

+       <driver name='vhost' txmode='iothread' ioeventfd='on' event_idx='off' queues='5' rx_queue_size='256' tx_queue_size='256' page_per_vq='on'>

          <host csum='off' gso='off' tso4='off' tso6='off' ecn='off' ufo='off' mrg_rxbuf='off'/>

          <guest csum='off' tso4='off' tso6='off' ecn='off' ufo='off'/>

        </driver>

@@ -5215,6 +5215,15 @@ following attributes are available for the ``"virtio"`` NIC driver:

    only for ``vhostuser`` type. :since:`Since 3.7.0 (QEMU and KVM only)`

    **In general you should leave this option alone, unless you are very certain

    you know what you are doing.**

+``page_per_vq``

+   This optional attribute controls the layout of the notification capabilities

+   exposed to the guest. When enabled, each virtio queue will have a dedicated

+   page on the device BAR exposed to the guest. It is recommended to be used when

+   vDPA is enabled on the hypervisor, as it enables mapping the notification area

+   to the physical device, which is only supported in page granularity. The

+   default is determined by QEMU; as off. :since:`Since 2.8.0 (QEMU only)`



The 'since' tag is primarily used for the libvirt version which

introduces the feature thus you must include that.



Also note that starting from the next release the minimum supported qemu

version will be bumped to qemu-2.10 thus the qemu version note is not

needed.


It isn't really clear what the 'since' tag refers to, if I understand your comment correctly it refers to the libvirt version, so I will change it to 7.4.0 and remove the '(QEMU only)' part.




+   **In general you should leave this option alone, unless you are very certain

+   you know what you are doing.**

 virtio options

    For virtio interfaces, `Virtio-specific options <#elementsVirtio>`__ can also

    be set. ( :since:`Since 3.5.0` )

diff --git a/docs/schemas/domaincommon.rng b/docs/schemas/domaincommon.rng

index a2e5c50..e61ad67 100644

--- a/docs/schemas/domaincommon.rng

+++ b/docs/schemas/domaincommon.rng

@@ -3463,6 +3463,11 @@

                 </attribute>

               </optional>

               <optional>

+                <attribute name="page_per_vq">

+                  <ref name="virOnOff"/>

+                </attribute>

+              </optional>

+              <optional>

                 <attribute name="txmode">

                   <choice>

                     <value>iothread</value>



[...]



@@ -1405,6 +1406,7 @@ static struct virQEMUCapsDevicePropsFlags virQEMUCapsDevicePropsVirtioNet[] = {

     { "event_idx", QEMU_CAPS_VIRTIO_NET_EVENT_IDX, NULL },

     { "rx_queue_size", QEMU_CAPS_VIRTIO_NET_RX_QUEUE_SIZE, NULL },

     { "tx_queue_size", QEMU_CAPS_VIRTIO_NET_TX_QUEUE_SIZE, NULL },

+    { "page_per_vq", QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ, NULL },

     { "host_mtu", QEMU_CAPS_VIRTIO_NET_HOST_MTU, NULL },

     { "disable-legacy", QEMU_CAPS_VIRTIO_PCI_DISABLE_LEGACY, NULL },

     { "iommu_platform", QEMU_CAPS_VIRTIO_PCI_IOMMU_PLATFORM, NULL },

diff --git a/src/qemu/qemu_capabilities.h b/src/qemu/qemu_capabilities.h

index f54aad5..75feed4 100644

--- a/src/qemu/qemu_capabilities.h

+++ b/src/qemu/qemu_capabilities.h

@@ -609,6 +609,7 @@ typedef enum { /* virQEMUCapsFlags grouping marker for syntax-check */

     /* 400 */

     QEMU_CAPS_COMPAT_DEPRECATED, /* -compat deprecated-(input|output) is supported */

     QEMU_CAPS_ACPI_INDEX, /* PCI device 'acpi-index' property */

+    QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ, /* virtio-net-*.page_per_vq */



     QEMU_CAPS_LAST /* this must always be the last item */

 } virQEMUCapsFlags;



1) qemu capabilities changes are usually separated into a separate commit

2) missing update to the capability data, either this doesn't detect

   anything or tests would be broken.

3) the capability is not necessary since all qemus supported since

   libvirt-7.4 will have it
Since this is not necessary I will remove the changes to qemu_capabilities*.



diff --git a/src/qemu/qemu_command.c b/src/qemu/qemu_command.c

index d7f1c71..7b5834f 100644

--- a/src/qemu/qemu_command.c

+++ b/src/qemu/qemu_command.c

@@ -3630,6 +3630,11 @@ qemuBuildNicDevStr(virDomainDef *def,

         if (net->driver.virtio.tx_queue_size)

             virBufferAsprintf(&buf, ",tx_queue_size=%u", net->driver.virtio.tx_queue_size);



+        if (net->driver.virtio.page_per_vq) {

+            virBufferAsprintf(&buf, ",page-per-vq=%s",

+                              virTristateSwitchTypeToString(net->driver.virtio.page_per_vq));

+        }

+

         if (net->mtu)

             virBufferAsprintf(&buf, ",host_mtu=%u", net->mtu);





[...]



diff --git a/tests/qemuxml2argvtest.c b/tests/qemuxml2argvtest.c

index f0efe98..f6b8f34 100644

--- a/tests/qemuxml2argvtest.c

+++ b/tests/qemuxml2argvtest.c

@@ -1654,6 +1654,8 @@ mymain(void)

             QEMU_CAPS_VIRTIO_NET_TX_QUEUE_SIZE);

     DO_TEST_PARSE_ERROR("net-virtio-rxqueuesize-invalid-size",

                         QEMU_CAPS_VIRTIO_NET_RX_QUEUE_SIZE);

+    DO_TEST("net-virtio-page-per-vq",

+            QEMU_CAPS_VIRTIO_NET_PAGE_PER_VQ);



New positive tests should not use DO_TEST, but DO_TEST_CAPS_LATEST.
Ack.




     DO_TEST("net-virtio-teaming",

             QEMU_CAPS_VIRTIO_NET_FAILOVER,

             QEMU_CAPS_DEVICE_VFIO_PCI);



[...]



According to the docs this is also a guest-visible change, thus we are

missing a check in the ABI stability checker code which ensures that the

guest ABI doesn't change.



See virDomainDefCheckABIStabilityFlags to find the appropriate place.


I couldn't find a related function for comparing driver options for virDomainNetDef (I expected to find something in virDomainNetDefCheckABIStability(), but didn't), please clarify what needs to be added here.