From nobody Sun May 19 09:08:45 2024 Delivered-To: importer@patchew.org Authentication-Results: mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=qemu-devel-bounces+importer=patchew.org@nongnu.org; dmarc=pass(p=none dis=none) header.from=yandex-team.ru ARC-Seal: i=1; a=rsa-sha256; t=1628500368; cv=none; d=zohomail.com; s=zohoarc; b=C5EoPyGidNDWhw/3vRgA1jmxBCKRrxK7JCNX9YVrkvVd180pEs2wm+LAtuTQDZmUSqiVxguKEK5v7Vo7Rgk1yk+4b2hNCCMhnLjn1gjtKY3AJ6vqLK36BvuEIHvlbt5N/3W6XnRBfcwXjJ/xccDPZWshxH8W0PteGI6obxxRgI0= ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.com; s=zohoarc; t=1628500368; h=Content-Transfer-Encoding:Cc:Date:From:List-Subscribe:List-Post:List-Id:List-Archive:List-Help:List-Unsubscribe:MIME-Version:Message-ID:Sender:Subject:To; bh=8zstp4F2UzTPgreG8klHVHb/ak+sB1FZQe0zoEpsfa8=; b=K53429E7MT9QbEdicrAGF2BhoqH9lj1yjncdwck/H4REvpGRKdg15T5XahLeLn3Sqkrq6FUF23AQM0ZUAWfE6BR4hoo8RqlNsgk6FekBEDgXZLEq8U9cPw+gLkiuXY15U/oiqjRhF4TFb3/jomxeWM/8beeinAkXxD2k5U1J3kI= ARC-Authentication-Results: i=1; mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=qemu-devel-bounces+importer=patchew.org@nongnu.org; dmarc=pass header.from= (p=none dis=none) Return-Path: Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mx.zohomail.com with SMTPS id 1628500368731186.13266300920986; Mon, 9 Aug 2021 02:12:48 -0700 (PDT) Received: from localhost ([::1]:60644 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mD1Kx-0005Gn-BD for importer@patchew.org; Mon, 09 Aug 2021 05:12:47 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:35596) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mD1CH-0000Lc-EB for qemu-devel@nongnu.org; Mon, 09 Aug 2021 05:03:49 -0400 Received: from forwardcorp1p.mail.yandex.net ([77.88.29.217]:47568) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mD1CC-00033f-V4 for qemu-devel@nongnu.org; Mon, 09 Aug 2021 05:03:47 -0400 Received: from vla1-fdfb804fb3f3.qloud-c.yandex.net (vla1-fdfb804fb3f3.qloud-c.yandex.net [IPv6:2a02:6b8:c0d:3199:0:640:fdfb:804f]) by forwardcorp1p.mail.yandex.net (Yandex) with ESMTP id 3E5FF2E16B0; Mon, 9 Aug 2021 12:03:39 +0300 (MSK) Received: from vla5-d6d5ce7a4718.qloud-c.yandex.net (vla5-d6d5ce7a4718.qloud-c.yandex.net [2a02:6b8:c18:341e:0:640:d6d5:ce7a]) by vla1-fdfb804fb3f3.qloud-c.yandex.net (mxbackcorp/Yandex) with ESMTP id AkxZ58IurR-3d0CLsMk; Mon, 09 Aug 2021 12:03:39 +0300 Received: from dynamic-vpn.dhcp.yndx.net (dynamic-vpn.dhcp.yndx.net [2a02:6b8:b081:8001::1:27]) by vla5-d6d5ce7a4718.qloud-c.yandex.net (smtpcorp/Yandex) with ESMTPSA id 3jNa8BApXr-3c2iFkSr; Mon, 09 Aug 2021 12:03:39 +0300 (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) (Client certificate not present) Precedence: bulk DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex-team.ru; s=default; t=1628499819; bh=8zstp4F2UzTPgreG8klHVHb/ak+sB1FZQe0zoEpsfa8=; h=Message-Id:Date:Subject:To:From:Cc; b=MQwTMt88Dtzz1qDdRL1fDKUWdAjGgLrZ6kQfsj0KT79cGzqwUPPlxBBY3CZP/Q44p BAE6YXjyOe1Uob4ooc0aI0WKlcBNcgFVwizHPOZ3/ikeWSKEOaaXGGjpjIGHX3rFju GBv4JL+RnZvVTrx5/BY9DoHx7GLHb7qVx6o9K94E= Authentication-Results: vla1-fdfb804fb3f3.qloud-c.yandex.net; dkim=pass header.i=@yandex-team.ru From: Denis Plotnikov To: qemu-devel@nongnu.org Subject: [PATCH v3] vhost: make SET_VRING_ADDR, SET_FEATURES send replies Date: Mon, 9 Aug 2021 12:03:30 +0300 Message-Id: <20210809090330.86304-1-den-plotnikov@yandex-team.ru> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Received-SPF: pass (zohomail.com: domain of gnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; envelope-from=qemu-devel-bounces+importer=patchew.org@nongnu.org; helo=lists.gnu.org; Received-SPF: pass client-ip=77.88.29.217; envelope-from=den-plotnikov@yandex-team.ru; helo=forwardcorp1p.mail.yandex.net X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.23 List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: yc-core@yandex-team.ru, mst@redhat.com Errors-To: qemu-devel-bounces+importer=patchew.org@nongnu.org Sender: "Qemu-devel" X-ZohoMail-DKIM: pass (identity @yandex-team.ru) X-ZM-MESSAGEID: 1628500369938100001 Content-Type: text/plain; charset="utf-8" On vhost-user-blk migration, qemu normally sends a number of commands to enable logging if VHOST_USER_PROTOCOL_F_LOG_SHMFD is negotiated. Qemu sends VHOST_USER_SET_FEATURES to enable buffers logging and VHOST_USER_SET_VRING_ADDR per each started ring to enable "used ring" data logging. The issue is that qemu doesn't wait for reply from the vhost daemon for these commands which may result in races between qemu expectation of logging starting and actual login starting in vhost daemon. The race can appear as follows: on migration setup, qemu enables dirty page logging by sending VHOST_USER_SET_FEATURES. The command doesn't arrive to a vhost-user-blk daemon immediately and the daemon needs some time to turn the logging on internally. If qemu doesn't wait for reply, after sending the command, qemu may start migrate memory pages to a destination. At this time, the logging may not be actually turned on in the daemon but some guest page= s, which the daemon is about to write to, may have already been transferred without logging to the destination. Since the logging wasn't turned on, those pages won't be transferred again as dirty. So we may end up with corrupted data on the destination. The same scenario is applicable for "used ring" data logging, which is turned on with VHOST_USER_SET_VRING_ADDR command. To resolve this issue, this patch makes qemu wait for the commands result explicilty if VHOST_USER_PROTOCOL_F_REPLY_ACK is negotiated and logging ena= bled. Signed-off-by: Denis Plotnikov --- v2 -> v3: * send VHOST_USER_GET_FEATURES to flush out outstanding messages [mst] v1 -> v2: * send reply only when logging is enabled [mst] v0 -> v1: * send reply for SET_VRING_ADDR, SET_FEATURES only [mst] --- hw/virtio/vhost-user.c | 130 ++++++++++++++++++++++++++++------------- 1 file changed, 89 insertions(+), 41 deletions(-) diff --git a/hw/virtio/vhost-user.c b/hw/virtio/vhost-user.c index ee57abe04526..18f685df549f 100644 --- a/hw/virtio/vhost-user.c +++ b/hw/virtio/vhost-user.c @@ -1095,23 +1095,6 @@ static int vhost_user_set_mem_table(struct vhost_dev= *dev, return 0; } =20 -static int vhost_user_set_vring_addr(struct vhost_dev *dev, - struct vhost_vring_addr *addr) -{ - VhostUserMsg msg =3D { - .hdr.request =3D VHOST_USER_SET_VRING_ADDR, - .hdr.flags =3D VHOST_USER_VERSION, - .payload.addr =3D *addr, - .hdr.size =3D sizeof(msg.payload.addr), - }; - - if (vhost_user_write(dev, &msg, NULL, 0) < 0) { - return -1; - } - - return 0; -} - static int vhost_user_set_vring_endian(struct vhost_dev *dev, struct vhost_vring_state *ring) { @@ -1288,72 +1271,137 @@ static int vhost_user_set_vring_call(struct vhost_= dev *dev, return vhost_set_vring_file(dev, VHOST_USER_SET_VRING_CALL, file); } =20 -static int vhost_user_set_u64(struct vhost_dev *dev, int request, uint64_t= u64) + +static int vhost_user_get_u64(struct vhost_dev *dev, int request, uint64_t= *u64) { VhostUserMsg msg =3D { .hdr.request =3D request, .hdr.flags =3D VHOST_USER_VERSION, - .payload.u64 =3D u64, - .hdr.size =3D sizeof(msg.payload.u64), }; =20 + if (vhost_user_one_time_request(request) && dev->vq_index !=3D 0) { + return 0; + } + if (vhost_user_write(dev, &msg, NULL, 0) < 0) { return -1; } =20 + if (vhost_user_read(dev, &msg) < 0) { + return -1; + } + + if (msg.hdr.request !=3D request) { + error_report("Received unexpected msg type. Expected %d received %= d", + request, msg.hdr.request); + return -1; + } + + if (msg.hdr.size !=3D sizeof(msg.payload.u64)) { + error_report("Received bad msg size."); + return -1; + } + + *u64 =3D msg.payload.u64; + return 0; } =20 -static int vhost_user_set_features(struct vhost_dev *dev, - uint64_t features) +static int vhost_user_get_features(struct vhost_dev *dev, uint64_t *featur= es) { - return vhost_user_set_u64(dev, VHOST_USER_SET_FEATURES, features); + return vhost_user_get_u64(dev, VHOST_USER_GET_FEATURES, features); } =20 -static int vhost_user_set_protocol_features(struct vhost_dev *dev, - uint64_t features) +static int enforce_reply(struct vhost_dev *dev) { - return vhost_user_set_u64(dev, VHOST_USER_SET_PROTOCOL_FEATURES, featu= res); + /* + * we need a reply but can't get it from some command directly, + * so send the command which must send a reply to make sure + * the command we sent before is actually completed. + */ + uint64_t dummy; + return vhost_user_get_features(dev, &dummy); } =20 -static int vhost_user_get_u64(struct vhost_dev *dev, int request, uint64_t= *u64) +static int vhost_user_set_vring_addr(struct vhost_dev *dev, + struct vhost_vring_addr *addr) { VhostUserMsg msg =3D { - .hdr.request =3D request, + .hdr.request =3D VHOST_USER_SET_VRING_ADDR, .hdr.flags =3D VHOST_USER_VERSION, + .payload.addr =3D *addr, + .hdr.size =3D sizeof(msg.payload.addr), }; =20 - if (vhost_user_one_time_request(request) && dev->vq_index !=3D 0) { - return 0; + bool reply_supported =3D virtio_has_feature(dev->protocol_features, + VHOST_USER_PROTOCOL_F_REPLY_= ACK); + + /* we need a reply anyway if logging is enabled */ + bool need_reply =3D !!(addr->flags & (1 << VHOST_VRING_F_LOG)); + + if (reply_supported && need_reply) { + msg.hdr.flags |=3D VHOST_USER_NEED_REPLY_MASK; } =20 if (vhost_user_write(dev, &msg, NULL, 0) < 0) { return -1; } =20 - if (vhost_user_read(dev, &msg) < 0) { - return -1; + if (msg.hdr.flags & VHOST_USER_NEED_REPLY_MASK) { + return process_message_reply(dev, &msg); + } else if (need_reply) { + return enforce_reply(dev); } =20 - if (msg.hdr.request !=3D request) { - error_report("Received unexpected msg type. Expected %d received %= d", - request, msg.hdr.request); - return -1; + return 0; +} + +static int vhost_user_set_u64(struct vhost_dev *dev, int request, uint64_t= u64, + bool need_reply) +{ + VhostUserMsg msg =3D { + .hdr.request =3D request, + .hdr.flags =3D VHOST_USER_VERSION, + .payload.u64 =3D u64, + .hdr.size =3D sizeof(msg.payload.u64), + }; + + if (need_reply) { + bool reply_supported =3D virtio_has_feature(dev->protocol_features, + VHOST_USER_PROTOCOL_F_REPLY_ACK); + if (reply_supported) { + msg.hdr.flags |=3D VHOST_USER_NEED_REPLY_MASK; + } } =20 - if (msg.hdr.size !=3D sizeof(msg.payload.u64)) { - error_report("Received bad msg size."); + if (vhost_user_write(dev, &msg, NULL, 0) < 0) { return -1; } =20 - *u64 =3D msg.payload.u64; + if (msg.hdr.flags & VHOST_USER_NEED_REPLY_MASK) { + return process_message_reply(dev, &msg); + } else if (need_reply) { + return enforce_reply(dev); + } =20 return 0; } =20 -static int vhost_user_get_features(struct vhost_dev *dev, uint64_t *featur= es) +static int vhost_user_set_features(struct vhost_dev *dev, + uint64_t features) { - return vhost_user_get_u64(dev, VHOST_USER_GET_FEATURES, features); + /* we need a reply anyway if logging is enabled */ + bool log_enabled =3D !!(features & (0x1ULL << VHOST_F_LOG_ALL)); + + return vhost_user_set_u64(dev, VHOST_USER_SET_FEATURES, features, + log_enabled); +} + +static int vhost_user_set_protocol_features(struct vhost_dev *dev, + uint64_t features) +{ + return vhost_user_set_u64(dev, VHOST_USER_SET_PROTOCOL_FEATURES, featu= res, + false); } =20 static int vhost_user_set_owner(struct vhost_dev *dev) --=20 2.25.1