[PATCH v2] qdev: Report an error for machine without HotplugHandler

Akihiko Odaki posted 1 patch 11 months, 3 weeks ago
Patches applied successfully (tree, apply log)
git fetch https://github.com/patchew-project/qemu tags/patchew/20231210-bus-v2-1-34ebf5726fa0@daynix.com
Maintainers: Paolo Bonzini <pbonzini@redhat.com>, "Daniel P. Berrangé" <berrange@redhat.com>, Eduardo Habkost <eduardo@habkost.net>
system/qdev-monitor.c | 13 ++++++++++---
1 file changed, 10 insertions(+), 3 deletions(-)
[PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Akihiko Odaki 11 months, 3 weeks ago
The HotplugHandler of the machine will be used when the parent bus does
not exist, but the machine may not have one. Report an error in such a
case instead of aborting.

Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
---
Changes in v2:
- Fixed indention.
- Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
---
 system/qdev-monitor.c | 13 ++++++++++---
 1 file changed, 10 insertions(+), 3 deletions(-)

diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
index a13db763e5..5fe5d49c20 100644
--- a/system/qdev-monitor.c
+++ b/system/qdev-monitor.c
@@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
     qdev_hot_removed = true;
 
     hotplug_ctrl = qdev_get_hotplug_handler(dev);
-    /* hotpluggable device MUST have HotplugHandler, if it doesn't
-     * then something is very wrong with it */
-    g_assert(hotplug_ctrl);
+    if (!hotplug_ctrl) {
+        /*
+         * hotpluggable bus MUST have HotplugHandler, if it doesn't
+         * then something is very wrong with it
+         */
+        assert(!dev->parent_bus);
+
+        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
+        return;
+    }
 
     /* If device supports async unplug just request it to be done,
      * otherwise just remove it synchronously */

---
base-commit: 4705fc0c8511d073bee4751c3c974aab2b10a970
change-id: 20231202-bus-75a454c5d959

Best regards,
-- 
Akihiko Odaki <akihiko.odaki@daynix.com>
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Markus Armbruster 11 months, 3 weeks ago
Akihiko Odaki <akihiko.odaki@daynix.com> writes:

> The HotplugHandler of the machine will be used when the parent bus does
> not exist, but the machine may not have one. Report an error in such a
> case instead of aborting.
>
> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>

Do you have a reproducer for the crash?

> ---
> Changes in v2:
> - Fixed indention.
> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
> ---
>  system/qdev-monitor.c | 13 ++++++++++---
>  1 file changed, 10 insertions(+), 3 deletions(-)
>
> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
> index a13db763e5..5fe5d49c20 100644
> --- a/system/qdev-monitor.c
> +++ b/system/qdev-monitor.c
> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
   void qdev_unplug(DeviceState *dev, Error **errp)
   {
       DeviceClass *dc = DEVICE_GET_CLASS(dev);
       HotplugHandler *hotplug_ctrl;
       HotplugHandlerClass *hdc;
       Error *local_err = NULL;

       if (qdev_unplug_blocked(dev, errp)) {
           return;
       }

       if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
           error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
           return;
       }

       if (!dc->hotpluggable) {
           error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
                      object_get_typename(OBJECT(dev)));
           return;
       }

       if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
           error_setg(errp, "device_del not allowed while migrating");
           return;
       }

>      qdev_hot_removed = true;
>  
>      hotplug_ctrl = qdev_get_hotplug_handler(dev);
> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
> -     * then something is very wrong with it */
> -    g_assert(hotplug_ctrl);
> +    if (!hotplug_ctrl) {
> +        /*
> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
> +         * then something is very wrong with it
> +         */
> +        assert(!dev->parent_bus);
> +
> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
> +        return;
> +    }

Extended version of my question above: what are the devices where
qdev_get_hotplug_handler(dev) returns null here?

>  
>      /* If device supports async unplug just request it to be done,
>       * otherwise just remove it synchronously */
>
> ---
> base-commit: 4705fc0c8511d073bee4751c3c974aab2b10a970
> change-id: 20231202-bus-75a454c5d959
>
> Best regards,
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Akihiko Odaki 11 months, 2 weeks ago
On 2023/12/11 15:51, Markus Armbruster wrote:
> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
> 
>> The HotplugHandler of the machine will be used when the parent bus does
>> not exist, but the machine may not have one. Report an error in such a
>> case instead of aborting.
>>
>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
> 
> Do you have a reproducer for the crash?
> 
>> ---
>> Changes in v2:
>> - Fixed indention.
>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
>> ---
>>   system/qdev-monitor.c | 13 ++++++++++---
>>   1 file changed, 10 insertions(+), 3 deletions(-)
>>
>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
>> index a13db763e5..5fe5d49c20 100644
>> --- a/system/qdev-monitor.c
>> +++ b/system/qdev-monitor.c
>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
>     void qdev_unplug(DeviceState *dev, Error **errp)
>     {
>         DeviceClass *dc = DEVICE_GET_CLASS(dev);
>         HotplugHandler *hotplug_ctrl;
>         HotplugHandlerClass *hdc;
>         Error *local_err = NULL;
> 
>         if (qdev_unplug_blocked(dev, errp)) {
>             return;
>         }
> 
>         if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
>             error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
>             return;
>         }
> 
>         if (!dc->hotpluggable) {
>             error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
>                        object_get_typename(OBJECT(dev)));
>             return;
>         }
> 
>         if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
>             error_setg(errp, "device_del not allowed while migrating");
>             return;
>         }
> 
>>       qdev_hot_removed = true;
>>   
>>       hotplug_ctrl = qdev_get_hotplug_handler(dev);
>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
>> -     * then something is very wrong with it */
>> -    g_assert(hotplug_ctrl);
>> +    if (!hotplug_ctrl) {
>> +        /*
>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
>> +         * then something is very wrong with it
>> +         */
>> +        assert(!dev->parent_bus);
>> +
>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
>> +        return;
>> +    }
> 
> Extended version of my question above: what are the devices where
> qdev_get_hotplug_handler(dev) returns null here?

Start a VM: qemu-system-aarch64 -M virt -nographic
Run the following on its HMP: device_del /machine/unattached/device[0]

It tries to unplug cortex-a15-arm-cpu and crashes.
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Markus Armbruster 11 months, 2 weeks ago
Akihiko Odaki <akihiko.odaki@daynix.com> writes:

> On 2023/12/11 15:51, Markus Armbruster wrote:
>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>> 
>>> The HotplugHandler of the machine will be used when the parent bus does
>>> not exist, but the machine may not have one. Report an error in such a
>>> case instead of aborting.
>>>
>>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
>>
>> Do you have a reproducer for the crash?
>> 
>>> ---
>>> Changes in v2:
>>> - Fixed indention.
>>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
>>> ---
>>>   system/qdev-monitor.c | 13 ++++++++++---
>>>   1 file changed, 10 insertions(+), 3 deletions(-)
>>>
>>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
>>> index a13db763e5..5fe5d49c20 100644
>>> --- a/system/qdev-monitor.c
>>> +++ b/system/qdev-monitor.c
>>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
>>   void qdev_unplug(DeviceState *dev, Error **errp)
>>   {
>>       DeviceClass *dc = DEVICE_GET_CLASS(dev);
>>       HotplugHandler *hotplug_ctrl;
>>       HotplugHandlerClass *hdc;
>>       Error *local_err = NULL;
>>       if (qdev_unplug_blocked(dev, errp)) {
>>           return;
>>       }
>>       if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
>>           error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
>>           return;
>>       }
>>       if (!dc->hotpluggable) {
>>           error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
>>                      object_get_typename(OBJECT(dev)));
>>           return;
>>       }
>>       if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
>>           error_setg(errp, "device_del not allowed while migrating");
>>           return;
>>       }
>> 
>>>      qdev_hot_removed = true;
>>>         hotplug_ctrl = qdev_get_hotplug_handler(dev);
>>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
>>> -     * then something is very wrong with it */
>>> -    g_assert(hotplug_ctrl);
>>> +    if (!hotplug_ctrl) {
>>> +        /*
>>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
>>> +         * then something is very wrong with it
>>> +         */
>>> +        assert(!dev->parent_bus);
>>> +
>>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
>>> +        return;
>>> +    }
>>
>> Extended version of my question above: what are the devices where
>> qdev_get_hotplug_handler(dev) returns null here?
>
> Start a VM: qemu-system-aarch64 -M virt -nographic
> Run the following on its HMP: device_del /machine/unattached/device[0]
>
> It tries to unplug cortex-a15-arm-cpu and crashes.

This device has no parent bus (dev->parent_bus is null), but is marked
hot-pluggable (dc->hotpluggable is true).  Question for somebody
familiar with the hot-plug machinery: is this sane?
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Akihiko Odaki 11 months, 1 week ago
On 2023/12/18 23:02, Markus Armbruster wrote:
> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
> 
>> On 2023/12/11 15:51, Markus Armbruster wrote:
>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>
>>>> The HotplugHandler of the machine will be used when the parent bus does
>>>> not exist, but the machine may not have one. Report an error in such a
>>>> case instead of aborting.
>>>>
>>>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
>>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
>>>
>>> Do you have a reproducer for the crash?
>>>
>>>> ---
>>>> Changes in v2:
>>>> - Fixed indention.
>>>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
>>>> ---
>>>>    system/qdev-monitor.c | 13 ++++++++++---
>>>>    1 file changed, 10 insertions(+), 3 deletions(-)
>>>>
>>>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
>>>> index a13db763e5..5fe5d49c20 100644
>>>> --- a/system/qdev-monitor.c
>>>> +++ b/system/qdev-monitor.c
>>>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
>>>    void qdev_unplug(DeviceState *dev, Error **errp)
>>>    {
>>>        DeviceClass *dc = DEVICE_GET_CLASS(dev);
>>>        HotplugHandler *hotplug_ctrl;
>>>        HotplugHandlerClass *hdc;
>>>        Error *local_err = NULL;
>>>        if (qdev_unplug_blocked(dev, errp)) {
>>>            return;
>>>        }
>>>        if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
>>>            error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
>>>            return;
>>>        }
>>>        if (!dc->hotpluggable) {
>>>            error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
>>>                       object_get_typename(OBJECT(dev)));
>>>            return;
>>>        }
>>>        if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
>>>            error_setg(errp, "device_del not allowed while migrating");
>>>            return;
>>>        }
>>>
>>>>       qdev_hot_removed = true;
>>>>          hotplug_ctrl = qdev_get_hotplug_handler(dev);
>>>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
>>>> -     * then something is very wrong with it */
>>>> -    g_assert(hotplug_ctrl);
>>>> +    if (!hotplug_ctrl) {
>>>> +        /*
>>>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
>>>> +         * then something is very wrong with it
>>>> +         */
>>>> +        assert(!dev->parent_bus);
>>>> +
>>>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
>>>> +        return;
>>>> +    }
>>>
>>> Extended version of my question above: what are the devices where
>>> qdev_get_hotplug_handler(dev) returns null here?
>>
>> Start a VM: qemu-system-aarch64 -M virt -nographic
>> Run the following on its HMP: device_del /machine/unattached/device[0]
>>
>> It tries to unplug cortex-a15-arm-cpu and crashes.
> 
> This device has no parent bus (dev->parent_bus is null), but is marked
> hot-pluggable (dc->hotpluggable is true).  Question for somebody
> familiar with the hot-plug machinery: is this sane?

Setting hotpluggable false for each device without bus_type gives the 
same effect, but is error-prone.
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Markus Armbruster 11 months, 1 week ago
Akihiko Odaki <akihiko.odaki@daynix.com> writes:

> On 2023/12/18 23:02, Markus Armbruster wrote:
>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>> 
>>> On 2023/12/11 15:51, Markus Armbruster wrote:
>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>
>>>>> The HotplugHandler of the machine will be used when the parent bus does
>>>>> not exist, but the machine may not have one. Report an error in such a
>>>>> case instead of aborting.
>>>>>
>>>>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
>>>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
>>>>
>>>> Do you have a reproducer for the crash?
>>>>
>>>>> ---
>>>>> Changes in v2:
>>>>> - Fixed indention.
>>>>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
>>>>> ---
>>>>>    system/qdev-monitor.c | 13 ++++++++++---
>>>>>    1 file changed, 10 insertions(+), 3 deletions(-)
>>>>>
>>>>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
>>>>> index a13db763e5..5fe5d49c20 100644
>>>>> --- a/system/qdev-monitor.c
>>>>> +++ b/system/qdev-monitor.c
>>>>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
>>>>    void qdev_unplug(DeviceState *dev, Error **errp)
>>>>    {
>>>>        DeviceClass *dc = DEVICE_GET_CLASS(dev);
>>>>        HotplugHandler *hotplug_ctrl;
>>>>        HotplugHandlerClass *hdc;
>>>>        Error *local_err = NULL;
>>>>        if (qdev_unplug_blocked(dev, errp)) {
>>>>            return;
>>>>        }
>>>>        if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
>>>>            error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
>>>>            return;
>>>>        }
>>>>        if (!dc->hotpluggable) {
>>>>            error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
>>>>                       object_get_typename(OBJECT(dev)));
>>>>            return;
>>>>        }
>>>>        if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
>>>>            error_setg(errp, "device_del not allowed while migrating");
>>>>            return;
>>>>        }
>>>>
>>>>>       qdev_hot_removed = true;
>>>>>          hotplug_ctrl = qdev_get_hotplug_handler(dev);
>>>>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
>>>>> -     * then something is very wrong with it */
>>>>> -    g_assert(hotplug_ctrl);
>>>>> +    if (!hotplug_ctrl) {
>>>>> +        /*
>>>>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
>>>>> +         * then something is very wrong with it
>>>>> +         */
>>>>> +        assert(!dev->parent_bus);
>>>>> +
>>>>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
>>>>> +        return;
>>>>> +    }
>>>>
>>>> Extended version of my question above: what are the devices where
>>>> qdev_get_hotplug_handler(dev) returns null here?
>>>
>>> Start a VM: qemu-system-aarch64 -M virt -nographic
>>> Run the following on its HMP: device_del /machine/unattached/device[0]
>>>
>>> It tries to unplug cortex-a15-arm-cpu and crashes.
>>
>> This device has no parent bus (dev->parent_bus is null), but is marked
>> hot-pluggable (dc->hotpluggable is true).  Question for somebody
>> familiar with the hot-plug machinery: is this sane?
>
> Setting hotpluggable false for each device without bus_type gives the same effect, but is error-prone.

Having hotpluggable = true when the device cannot be hot-plugged is
*wrong*.  You might be able to paper over the wrongness so the code
works anyway, but nothing good can come out of lying to developers
trying to understand how the code works.

Three ideas to avoid the lying:

1. default hotpluggable to bus_type != NULL.

2. assert(dc->bus_type || !dc->hotpluggable) in a suitable spot.

3. Change the meaning of hotpluggable, and rename it to reflect its new
meaning.  Requires a careful reading of its uses.  I wouldn't go there.
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Akihiko Odaki 9 months, 1 week ago
On 2023/12/20 16:53, Markus Armbruster wrote:
> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
> 
>> On 2023/12/18 23:02, Markus Armbruster wrote:
>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>
>>>> On 2023/12/11 15:51, Markus Armbruster wrote:
>>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>>
>>>>>> The HotplugHandler of the machine will be used when the parent bus does
>>>>>> not exist, but the machine may not have one. Report an error in such a
>>>>>> case instead of aborting.
>>>>>>
>>>>>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
>>>>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
>>>>>
>>>>> Do you have a reproducer for the crash?
>>>>>
>>>>>> ---
>>>>>> Changes in v2:
>>>>>> - Fixed indention.
>>>>>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
>>>>>> ---
>>>>>>     system/qdev-monitor.c | 13 ++++++++++---
>>>>>>     1 file changed, 10 insertions(+), 3 deletions(-)
>>>>>>
>>>>>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
>>>>>> index a13db763e5..5fe5d49c20 100644
>>>>>> --- a/system/qdev-monitor.c
>>>>>> +++ b/system/qdev-monitor.c
>>>>>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
>>>>>     void qdev_unplug(DeviceState *dev, Error **errp)
>>>>>     {
>>>>>         DeviceClass *dc = DEVICE_GET_CLASS(dev);
>>>>>         HotplugHandler *hotplug_ctrl;
>>>>>         HotplugHandlerClass *hdc;
>>>>>         Error *local_err = NULL;
>>>>>         if (qdev_unplug_blocked(dev, errp)) {
>>>>>             return;
>>>>>         }
>>>>>         if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
>>>>>             error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
>>>>>             return;
>>>>>         }
>>>>>         if (!dc->hotpluggable) {
>>>>>             error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
>>>>>                        object_get_typename(OBJECT(dev)));
>>>>>             return;
>>>>>         }
>>>>>         if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
>>>>>             error_setg(errp, "device_del not allowed while migrating");
>>>>>             return;
>>>>>         }
>>>>>
>>>>>>        qdev_hot_removed = true;
>>>>>>           hotplug_ctrl = qdev_get_hotplug_handler(dev);
>>>>>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
>>>>>> -     * then something is very wrong with it */
>>>>>> -    g_assert(hotplug_ctrl);
>>>>>> +    if (!hotplug_ctrl) {
>>>>>> +        /*
>>>>>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
>>>>>> +         * then something is very wrong with it
>>>>>> +         */
>>>>>> +        assert(!dev->parent_bus);
>>>>>> +
>>>>>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
>>>>>> +        return;
>>>>>> +    }
>>>>>
>>>>> Extended version of my question above: what are the devices where
>>>>> qdev_get_hotplug_handler(dev) returns null here?
>>>>
>>>> Start a VM: qemu-system-aarch64 -M virt -nographic
>>>> Run the following on its HMP: device_del /machine/unattached/device[0]
>>>>
>>>> It tries to unplug cortex-a15-arm-cpu and crashes.
>>>
>>> This device has no parent bus (dev->parent_bus is null), but is marked
>>> hot-pluggable (dc->hotpluggable is true).  Question for somebody
>>> familiar with the hot-plug machinery: is this sane?
>>
>> Setting hotpluggable false for each device without bus_type gives the same effect, but is error-prone.
> 
> Having hotpluggable = true when the device cannot be hot-plugged is
> *wrong*.  You might be able to paper over the wrongness so the code
> works anyway, but nothing good can come out of lying to developers
> trying to understand how the code works.

Hi,

I'm now revisiting this patch and now I think it is still semantically 
correct.

This patch indeed prevents hotplugging a hotpluggable device and that 
may sound irrational. However, we should note that the entity that 
prevents hotplugging is not the device, but the machine that lacks a 
hotplug handler. So we can say the device itself is hotpluggable, but 
the machine is preventing hotplugging.

We already do similar in a case that a device has a parent bus. 
qbus_is_hotpluggable() returns false if the parent bus lacks a hotplug 
handler and prevents from hotplugging a hotpluggable device. The device 
class must still have hotpluggable = true in such a case because another 
instance of device may be plugged into a bus that has a hotplug handler.

I'll submit v3 soon so please check if this reasoning sounds valid for 
you and review it once I submit it.

Regards,
Akihiko Odaki
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Zhao Liu 11 months, 1 week ago
Hi Markus,

On Wed, Dec 20, 2023 at 08:53:21AM +0100, Markus Armbruster wrote:
> Date: Wed, 20 Dec 2023 08:53:21 +0100
> From: Markus Armbruster <armbru@redhat.com>
> Subject: Re: [PATCH v2] qdev: Report an error for machine without
>  HotplugHandler
> 
> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
> 
> > On 2023/12/18 23:02, Markus Armbruster wrote:
> >> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
> >> 
> >>> On 2023/12/11 15:51, Markus Armbruster wrote:
> >>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
> >>>>
> >>>>> The HotplugHandler of the machine will be used when the parent bus does
> >>>>> not exist, but the machine may not have one. Report an error in such a
> >>>>> case instead of aborting.
> >>>>>
> >>>>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
> >>>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
> >>>>
> >>>> Do you have a reproducer for the crash?
> >>>>
> >>>>> ---
> >>>>> Changes in v2:
> >>>>> - Fixed indention.
> >>>>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
> >>>>> ---
> >>>>>    system/qdev-monitor.c | 13 ++++++++++---
> >>>>>    1 file changed, 10 insertions(+), 3 deletions(-)
> >>>>>
> >>>>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
> >>>>> index a13db763e5..5fe5d49c20 100644
> >>>>> --- a/system/qdev-monitor.c
> >>>>> +++ b/system/qdev-monitor.c
> >>>>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
> >>>>    void qdev_unplug(DeviceState *dev, Error **errp)
> >>>>    {
> >>>>        DeviceClass *dc = DEVICE_GET_CLASS(dev);
> >>>>        HotplugHandler *hotplug_ctrl;
> >>>>        HotplugHandlerClass *hdc;
> >>>>        Error *local_err = NULL;
> >>>>        if (qdev_unplug_blocked(dev, errp)) {
> >>>>            return;
> >>>>        }
> >>>>        if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
> >>>>            error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
> >>>>            return;
> >>>>        }
> >>>>        if (!dc->hotpluggable) {
> >>>>            error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
> >>>>                       object_get_typename(OBJECT(dev)));
> >>>>            return;
> >>>>        }
> >>>>        if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
> >>>>            error_setg(errp, "device_del not allowed while migrating");
> >>>>            return;
> >>>>        }
> >>>>
> >>>>>       qdev_hot_removed = true;
> >>>>>          hotplug_ctrl = qdev_get_hotplug_handler(dev);
> >>>>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
> >>>>> -     * then something is very wrong with it */
> >>>>> -    g_assert(hotplug_ctrl);
> >>>>> +    if (!hotplug_ctrl) {
> >>>>> +        /*
> >>>>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
> >>>>> +         * then something is very wrong with it
> >>>>> +         */
> >>>>> +        assert(!dev->parent_bus);
> >>>>> +
> >>>>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
> >>>>> +        return;
> >>>>> +    }
> >>>>
> >>>> Extended version of my question above: what are the devices where
> >>>> qdev_get_hotplug_handler(dev) returns null here?
> >>>
> >>> Start a VM: qemu-system-aarch64 -M virt -nographic
> >>> Run the following on its HMP: device_del /machine/unattached/device[0]
> >>>
> >>> It tries to unplug cortex-a15-arm-cpu and crashes.
> >>
> >> This device has no parent bus (dev->parent_bus is null), but is marked
> >> hot-pluggable (dc->hotpluggable is true).  Question for somebody
> >> familiar with the hot-plug machinery: is this sane?
> >
> > Setting hotpluggable false for each device without bus_type gives the same effect, but is error-prone.
> 
> Having hotpluggable = true when the device cannot be hot-plugged is
> *wrong*.  You might be able to paper over the wrongness so the code
> works anyway, but nothing good can come out of lying to developers
> trying to understand how the code works.
> 
> Three ideas to avoid the lying:
> 
> 1. default hotpluggable to bus_type != NULL.
> 
> 2. assert(dc->bus_type || !dc->hotpluggable) in a suitable spot.
> 
> 3. Change the meaning of hotpluggable, and rename it to reflect its new
> meaning.  Requires a careful reading of its uses.  I wouldn't go there.
> 

What about 4 (or maybe 3.1) - droping this hotpluggable flag and just use a
helper (like qbus) to check if device is hotpluggable?

This removes the confusion of that flag and also reduces the number of
configuration items for DeviceState that require developer attention.
A simple helper is as follows:

static inline bool qdev_is_hotpluggable(DeviceState *dev)
{
    /*
     * Many Machines don't implement qdev_hotplug_allowed().
     *
     * TODO: Once all of them complete missing qdev_hotplug_allowed(),
     *       use qdev_hotplug_allowed() here.
     */
    bool hotpluggable = !!qdev_get_machine_hotplug_handler(dev);

    if (!hotpluggable && dev->parent_bus) {
        hotpluggable = qbus_is_hotpluggable(dev->parent_bus);
    }

    return hotpluggable;
}

Thanks,
Zhao
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Akihiko Odaki 11 months, 1 week ago
On 2023/12/21 1:46, Zhao Liu wrote:
> Hi Markus,
> 
> On Wed, Dec 20, 2023 at 08:53:21AM +0100, Markus Armbruster wrote:
>> Date: Wed, 20 Dec 2023 08:53:21 +0100
>> From: Markus Armbruster <armbru@redhat.com>
>> Subject: Re: [PATCH v2] qdev: Report an error for machine without
>>   HotplugHandler
>>
>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>
>>> On 2023/12/18 23:02, Markus Armbruster wrote:
>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>
>>>>> On 2023/12/11 15:51, Markus Armbruster wrote:
>>>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>>>
>>>>>>> The HotplugHandler of the machine will be used when the parent bus does
>>>>>>> not exist, but the machine may not have one. Report an error in such a
>>>>>>> case instead of aborting.
>>>>>>>
>>>>>>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
>>>>>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
>>>>>>
>>>>>> Do you have a reproducer for the crash?
>>>>>>
>>>>>>> ---
>>>>>>> Changes in v2:
>>>>>>> - Fixed indention.
>>>>>>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
>>>>>>> ---
>>>>>>>     system/qdev-monitor.c | 13 ++++++++++---
>>>>>>>     1 file changed, 10 insertions(+), 3 deletions(-)
>>>>>>>
>>>>>>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
>>>>>>> index a13db763e5..5fe5d49c20 100644
>>>>>>> --- a/system/qdev-monitor.c
>>>>>>> +++ b/system/qdev-monitor.c
>>>>>>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
>>>>>>     void qdev_unplug(DeviceState *dev, Error **errp)
>>>>>>     {
>>>>>>         DeviceClass *dc = DEVICE_GET_CLASS(dev);
>>>>>>         HotplugHandler *hotplug_ctrl;
>>>>>>         HotplugHandlerClass *hdc;
>>>>>>         Error *local_err = NULL;
>>>>>>         if (qdev_unplug_blocked(dev, errp)) {
>>>>>>             return;
>>>>>>         }
>>>>>>         if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
>>>>>>             error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
>>>>>>             return;
>>>>>>         }
>>>>>>         if (!dc->hotpluggable) {
>>>>>>             error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
>>>>>>                        object_get_typename(OBJECT(dev)));
>>>>>>             return;
>>>>>>         }
>>>>>>         if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
>>>>>>             error_setg(errp, "device_del not allowed while migrating");
>>>>>>             return;
>>>>>>         }
>>>>>>
>>>>>>>        qdev_hot_removed = true;
>>>>>>>           hotplug_ctrl = qdev_get_hotplug_handler(dev);
>>>>>>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
>>>>>>> -     * then something is very wrong with it */
>>>>>>> -    g_assert(hotplug_ctrl);
>>>>>>> +    if (!hotplug_ctrl) {
>>>>>>> +        /*
>>>>>>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
>>>>>>> +         * then something is very wrong with it
>>>>>>> +         */
>>>>>>> +        assert(!dev->parent_bus);
>>>>>>> +
>>>>>>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
>>>>>>> +        return;
>>>>>>> +    }
>>>>>>
>>>>>> Extended version of my question above: what are the devices where
>>>>>> qdev_get_hotplug_handler(dev) returns null here?
>>>>>
>>>>> Start a VM: qemu-system-aarch64 -M virt -nographic
>>>>> Run the following on its HMP: device_del /machine/unattached/device[0]
>>>>>
>>>>> It tries to unplug cortex-a15-arm-cpu and crashes.
>>>>
>>>> This device has no parent bus (dev->parent_bus is null), but is marked
>>>> hot-pluggable (dc->hotpluggable is true).  Question for somebody
>>>> familiar with the hot-plug machinery: is this sane?
>>>
>>> Setting hotpluggable false for each device without bus_type gives the same effect, but is error-prone.
>>
>> Having hotpluggable = true when the device cannot be hot-plugged is
>> *wrong*.  You might be able to paper over the wrongness so the code
>> works anyway, but nothing good can come out of lying to developers
>> trying to understand how the code works.
>>
>> Three ideas to avoid the lying:
>>
>> 1. default hotpluggable to bus_type != NULL.

I don't have an idea to achieve that. Currently bus_type is set after 
hotpluggable.

>>
>> 2. assert(dc->bus_type || !dc->hotpluggable) in a suitable spot.

It results in abortion and doesn't improve the situation.

>>
>> 3. Change the meaning of hotpluggable, and rename it to reflect its new
>> meaning.  Requires a careful reading of its uses.  I wouldn't go there.

I don't have an idea for such a naming.

So I'm stuck with the current proposal. It suppresses abortion at least. 
Any alternative idea is welcome.

>>
> 
> What about 4 (or maybe 3.1) - droping this hotpluggable flag and just use a
> helper (like qbus) to check if device is hotpluggable?
> 
> This removes the confusion of that flag and also reduces the number of
> configuration items for DeviceState that require developer attention.
> A simple helper is as follows:

Some devices simply doesn't support hotplugging even if the bus 
supports. virtio-gpu-pci doesn't support hotplugging because the display 
infrastructure cannot handle hotplugging, for example.

Regards,
Akihiko Odaki
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Markus Armbruster 11 months, 1 week ago
Akihiko Odaki <akihiko.odaki@daynix.com> writes:

> On 2023/12/21 1:46, Zhao Liu wrote:
>> Hi Markus,
>> On Wed, Dec 20, 2023 at 08:53:21AM +0100, Markus Armbruster wrote:
>>> Date: Wed, 20 Dec 2023 08:53:21 +0100
>>> From: Markus Armbruster <armbru@redhat.com>
>>> Subject: Re: [PATCH v2] qdev: Report an error for machine without
>>>   HotplugHandler
>>>
>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>
>>>> On 2023/12/18 23:02, Markus Armbruster wrote:
>>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>>
>>>>>> On 2023/12/11 15:51, Markus Armbruster wrote:
>>>>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>>>>
>>>>>>>> The HotplugHandler of the machine will be used when the parent bus does
>>>>>>>> not exist, but the machine may not have one. Report an error in such a
>>>>>>>> case instead of aborting.
>>>>>>>>
>>>>>>>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
>>>>>>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
>>>>>>>
>>>>>>> Do you have a reproducer for the crash?
>>>>>>>
>>>>>>>> ---
>>>>>>>> Changes in v2:
>>>>>>>> - Fixed indention.
>>>>>>>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
>>>>>>>> ---
>>>>>>>>     system/qdev-monitor.c | 13 ++++++++++---
>>>>>>>>     1 file changed, 10 insertions(+), 3 deletions(-)
>>>>>>>>
>>>>>>>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
>>>>>>>> index a13db763e5..5fe5d49c20 100644
>>>>>>>> --- a/system/qdev-monitor.c
>>>>>>>> +++ b/system/qdev-monitor.c
>>>>>>>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
>>>>>>>     void qdev_unplug(DeviceState *dev, Error **errp)
>>>>>>>     {
>>>>>>>         DeviceClass *dc = DEVICE_GET_CLASS(dev);
>>>>>>>         HotplugHandler *hotplug_ctrl;
>>>>>>>         HotplugHandlerClass *hdc;
>>>>>>>         Error *local_err = NULL;
>>>>>>>         if (qdev_unplug_blocked(dev, errp)) {
>>>>>>>             return;
>>>>>>>         }
>>>>>>>         if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
>>>>>>>             error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
>>>>>>>             return;
>>>>>>>         }
>>>>>>>         if (!dc->hotpluggable) {
>>>>>>>             error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
>>>>>>>                        object_get_typename(OBJECT(dev)));
>>>>>>>             return;
>>>>>>>         }
>>>>>>>         if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
>>>>>>>             error_setg(errp, "device_del not allowed while migrating");
>>>>>>>             return;
>>>>>>>         }
>>>>>>>
>>>>>>>>        qdev_hot_removed = true;
>>>>>>>>           hotplug_ctrl = qdev_get_hotplug_handler(dev);
>>>>>>>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
>>>>>>>> -     * then something is very wrong with it */
>>>>>>>> -    g_assert(hotplug_ctrl);
>>>>>>>> +    if (!hotplug_ctrl) {
>>>>>>>> +        /*
>>>>>>>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
>>>>>>>> +         * then something is very wrong with it
>>>>>>>> +         */
>>>>>>>> +        assert(!dev->parent_bus);
>>>>>>>> +
>>>>>>>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
>>>>>>>> +        return;
>>>>>>>> +    }
>>>>>>>
>>>>>>> Extended version of my question above: what are the devices where
>>>>>>> qdev_get_hotplug_handler(dev) returns null here?
>>>>>>
>>>>>> Start a VM: qemu-system-aarch64 -M virt -nographic
>>>>>> Run the following on its HMP: device_del /machine/unattached/device[0]
>>>>>>
>>>>>> It tries to unplug cortex-a15-arm-cpu and crashes.
>>>>>
>>>>> This device has no parent bus (dev->parent_bus is null), but is marked
>>>>> hot-pluggable (dc->hotpluggable is true).  Question for somebody
>>>>> familiar with the hot-plug machinery: is this sane?
>>>>
>>>> Setting hotpluggable false for each device without bus_type gives the same effect, but is error-prone.
>>>
>>> Having hotpluggable = true when the device cannot be hot-plugged is
>>> *wrong*.  You might be able to paper over the wrongness so the code
>>> works anyway, but nothing good can come out of lying to developers
>>> trying to understand how the code works.
>>>
>>> Three ideas to avoid the lying:
>>>
>>> 1. default hotpluggable to bus_type != NULL.
>
> I don't have an idea to achieve that. Currently bus_type is set after hotpluggable.
>
>>>
>>> 2. assert(dc->bus_type || !dc->hotpluggable) in a suitable spot.
>
> It results in abortion and doesn't improve the situation.

Oh, it does!  The abort leads us to all the places where we currently
lie (by having dc->hotpluggable = true when it isn't), so we can fix
them.

>>> 3. Change the meaning of hotpluggable, and rename it to reflect its new
>>> meaning.  Requires a careful reading of its uses.  I wouldn't go there.
>
> I don't have an idea for such a naming.
>
> So I'm stuck with the current proposal. It suppresses abortion at least. Any alternative idea is welcome.
>
>>>
>> What about 4 (or maybe 3.1) - droping this hotpluggable flag and just use a
>> helper (like qbus) to check if device is hotpluggable?
>> This removes the confusion of that flag and also reduces the number of
>> configuration items for DeviceState that require developer attention.
>> A simple helper is as follows:
>
> Some devices simply doesn't support hotplugging even if the bus supports. virtio-gpu-pci doesn't support hotplugging because the display infrastructure cannot handle hotplugging, for example.
>
> Regards,
> Akihiko Odaki
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Akihiko Odaki 11 months, 1 week ago
On 2023/12/21 17:49, Markus Armbruster wrote:
> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
> 
>> On 2023/12/21 1:46, Zhao Liu wrote:
>>> Hi Markus,
>>> On Wed, Dec 20, 2023 at 08:53:21AM +0100, Markus Armbruster wrote:
>>>> Date: Wed, 20 Dec 2023 08:53:21 +0100
>>>> From: Markus Armbruster <armbru@redhat.com>
>>>> Subject: Re: [PATCH v2] qdev: Report an error for machine without
>>>>    HotplugHandler
>>>>
>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>
>>>>> On 2023/12/18 23:02, Markus Armbruster wrote:
>>>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>>>
>>>>>>> On 2023/12/11 15:51, Markus Armbruster wrote:
>>>>>>>> Akihiko Odaki <akihiko.odaki@daynix.com> writes:
>>>>>>>>
>>>>>>>>> The HotplugHandler of the machine will be used when the parent bus does
>>>>>>>>> not exist, but the machine may not have one. Report an error in such a
>>>>>>>>> case instead of aborting.
>>>>>>>>>
>>>>>>>>> Fixes: 7716b8ca74 ("qdev: HotplugHandler: Add support for unplugging BUS-less devices")
>>>>>>>>> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
>>>>>>>>
>>>>>>>> Do you have a reproducer for the crash?
>>>>>>>>
>>>>>>>>> ---
>>>>>>>>> Changes in v2:
>>>>>>>>> - Fixed indention.
>>>>>>>>> - Link to v1: https://lore.kernel.org/r/20231202-bus-v1-1-f7540e3a8d62@daynix.com
>>>>>>>>> ---
>>>>>>>>>      system/qdev-monitor.c | 13 ++++++++++---
>>>>>>>>>      1 file changed, 10 insertions(+), 3 deletions(-)
>>>>>>>>>
>>>>>>>>> diff --git a/system/qdev-monitor.c b/system/qdev-monitor.c
>>>>>>>>> index a13db763e5..5fe5d49c20 100644
>>>>>>>>> --- a/system/qdev-monitor.c
>>>>>>>>> +++ b/system/qdev-monitor.c
>>>>>>>>> @@ -927,9 +927,16 @@ void qdev_unplug(DeviceState *dev, Error **errp)
>>>>>>>>      void qdev_unplug(DeviceState *dev, Error **errp)
>>>>>>>>      {
>>>>>>>>          DeviceClass *dc = DEVICE_GET_CLASS(dev);
>>>>>>>>          HotplugHandler *hotplug_ctrl;
>>>>>>>>          HotplugHandlerClass *hdc;
>>>>>>>>          Error *local_err = NULL;
>>>>>>>>          if (qdev_unplug_blocked(dev, errp)) {
>>>>>>>>              return;
>>>>>>>>          }
>>>>>>>>          if (dev->parent_bus && !qbus_is_hotpluggable(dev->parent_bus)) {
>>>>>>>>              error_setg(errp, QERR_BUS_NO_HOTPLUG, dev->parent_bus->name);
>>>>>>>>              return;
>>>>>>>>          }
>>>>>>>>          if (!dc->hotpluggable) {
>>>>>>>>              error_setg(errp, QERR_DEVICE_NO_HOTPLUG,
>>>>>>>>                         object_get_typename(OBJECT(dev)));
>>>>>>>>              return;
>>>>>>>>          }
>>>>>>>>          if (!migration_is_idle() && !dev->allow_unplug_during_migration) {
>>>>>>>>              error_setg(errp, "device_del not allowed while migrating");
>>>>>>>>              return;
>>>>>>>>          }
>>>>>>>>
>>>>>>>>>         qdev_hot_removed = true;
>>>>>>>>>            hotplug_ctrl = qdev_get_hotplug_handler(dev);
>>>>>>>>> -    /* hotpluggable device MUST have HotplugHandler, if it doesn't
>>>>>>>>> -     * then something is very wrong with it */
>>>>>>>>> -    g_assert(hotplug_ctrl);
>>>>>>>>> +    if (!hotplug_ctrl) {
>>>>>>>>> +        /*
>>>>>>>>> +         * hotpluggable bus MUST have HotplugHandler, if it doesn't
>>>>>>>>> +         * then something is very wrong with it
>>>>>>>>> +         */
>>>>>>>>> +        assert(!dev->parent_bus);
>>>>>>>>> +
>>>>>>>>> +        error_setg(errp, "The machine does not support hotplugging for a device without parent bus");
>>>>>>>>> +        return;
>>>>>>>>> +    }
>>>>>>>>
>>>>>>>> Extended version of my question above: what are the devices where
>>>>>>>> qdev_get_hotplug_handler(dev) returns null here?
>>>>>>>
>>>>>>> Start a VM: qemu-system-aarch64 -M virt -nographic
>>>>>>> Run the following on its HMP: device_del /machine/unattached/device[0]
>>>>>>>
>>>>>>> It tries to unplug cortex-a15-arm-cpu and crashes.
>>>>>>
>>>>>> This device has no parent bus (dev->parent_bus is null), but is marked
>>>>>> hot-pluggable (dc->hotpluggable is true).  Question for somebody
>>>>>> familiar with the hot-plug machinery: is this sane?
>>>>>
>>>>> Setting hotpluggable false for each device without bus_type gives the same effect, but is error-prone.
>>>>
>>>> Having hotpluggable = true when the device cannot be hot-plugged is
>>>> *wrong*.  You might be able to paper over the wrongness so the code
>>>> works anyway, but nothing good can come out of lying to developers
>>>> trying to understand how the code works.
>>>>
>>>> Three ideas to avoid the lying:
>>>>
>>>> 1. default hotpluggable to bus_type != NULL.
>>
>> I don't have an idea to achieve that. Currently bus_type is set after hotpluggable.
>>
>>>>
>>>> 2. assert(dc->bus_type || !dc->hotpluggable) in a suitable spot.
>>
>> It results in abortion and doesn't improve the situation.
> 
> Oh, it does!  The abort leads us to all the places where we currently
> lie (by having dc->hotpluggable = true when it isn't), so we can fix
> them.

I get your point. So we can put the assertion to the point that runs 
after device realization and let such devices break. It's quite 
disruptive but works.
Re: [PATCH v2] qdev: Report an error for machine without HotplugHandler
Posted by Markus Armbruster 11 months, 1 week ago
Zhao Liu <zhao1.liu@intel.com> writes:

> Hi Markus,
>
> On Wed, Dec 20, 2023 at 08:53:21AM +0100, Markus Armbruster wrote:
>> Date: Wed, 20 Dec 2023 08:53:21 +0100
>> From: Markus Armbruster <armbru@redhat.com>
>> Subject: Re: [PATCH v2] qdev: Report an error for machine without
>>  HotplugHandler
>> 
>> Having hotpluggable = true when the device cannot be hot-plugged is
>> *wrong*.  You might be able to paper over the wrongness so the code
>> works anyway, but nothing good can come out of lying to developers
>> trying to understand how the code works.
>> 
>> Three ideas to avoid the lying:
>> 
>> 1. default hotpluggable to bus_type != NULL.
>> 
>> 2. assert(dc->bus_type || !dc->hotpluggable) in a suitable spot.
>> 
>> 3. Change the meaning of hotpluggable, and rename it to reflect its new
>> meaning.  Requires a careful reading of its uses.  I wouldn't go there.
>> 
>
> What about 4 (or maybe 3.1) - droping this hotpluggable flag and just use a
> helper (like qbus) to check if device is hotpluggable?
>
> This removes the confusion of that flag and also reduces the number of
> configuration items for DeviceState that require developer attention.
> A simple helper is as follows:
>
> static inline bool qdev_is_hotpluggable(DeviceState *dev)
> {
>     /*
>      * Many Machines don't implement qdev_hotplug_allowed().
>      *
>      * TODO: Once all of them complete missing qdev_hotplug_allowed(),
>      *       use qdev_hotplug_allowed() here.
>      */
>     bool hotpluggable = !!qdev_get_machine_hotplug_handler(dev);
>
>     if (!hotpluggable && dev->parent_bus) {
>         hotpluggable = qbus_is_hotpluggable(dev->parent_bus);
>     }
>
>     return hotpluggable;
> }

Worth exploring, I think.