drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c | 2 -- 1 file changed, 2 deletions(-)
The netdevice usage count increases during transmit queue timeouts
because netdev_hold is called in ndo_tx_timeout, scheduling a task
to reinitialize the card. Although netdev_put is called at the end
of the scheduled work, rtnl_unlock checks the reference count during
cleanup. This could cause issues if transmit timeout is called on
multiple queues. Therefore, netdev_hold and netdev_put have been removed.
Fixes: cb7dd712189f ("octeon_ep_vf: Add driver framework and device initialization")
Signed-off-by: Sathesh B Edara <sedara@marvell.com>
---
Changes:
V3:
- Added more description to commit message
V2:
- Removed redundant call
drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c | 2 --
1 file changed, 2 deletions(-)
diff --git a/drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c b/drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c
index 18c922dd5fc6..5d033bc66bdf 100644
--- a/drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c
+++ b/drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c
@@ -819,7 +819,6 @@ static void octep_vf_tx_timeout_task(struct work_struct *work)
octep_vf_open(netdev);
}
rtnl_unlock();
- netdev_put(netdev, NULL);
}
/**
@@ -834,7 +833,6 @@ static void octep_vf_tx_timeout(struct net_device *netdev, unsigned int txqueue)
{
struct octep_vf_device *oct = netdev_priv(netdev);
- netdev_hold(netdev, NULL, GFP_ATOMIC);
schedule_work(&oct->tx_timeout_task);
}
--
2.36.0
On 4/16/2025 3:25 AM, Sathesh B Edara wrote:
> The netdevice usage count increases during transmit queue timeouts
> because netdev_hold is called in ndo_tx_timeout, scheduling a task
> to reinitialize the card. Although netdev_put is called at the end
> of the scheduled work, rtnl_unlock checks the reference count during
> cleanup. This could cause issues if transmit timeout is called on
> multiple queues. Therefore, netdev_hold and netdev_put have been removed.
>
> Fixes: cb7dd712189f ("octeon_ep_vf: Add driver framework and device initialization")
> Signed-off-by: Sathesh B Edara <sedara@marvell.com>
> ---
> Changes:
> V3:
> - Added more description to commit message
> V2:
> - Removed redundant call
>
> drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c | 2 --
> 1 file changed, 2 deletions(-)
>
> diff --git a/drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c b/drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c
> index 18c922dd5fc6..5d033bc66bdf 100644
> --- a/drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c
> +++ b/drivers/net/ethernet/marvell/octeon_ep_vf/octep_vf_main.c
> @@ -819,7 +819,6 @@ static void octep_vf_tx_timeout_task(struct work_struct *work)
> octep_vf_open(netdev);
> }
> rtnl_unlock();
> - netdev_put(netdev, NULL);
> }
>
> /**
> @@ -834,7 +833,6 @@ static void octep_vf_tx_timeout(struct net_device *netdev, unsigned int txqueue)
> {
> struct octep_vf_device *oct = netdev_priv(netdev);
>
> - netdev_hold(netdev, NULL, GFP_ATOMIC);
> schedule_work(&oct->tx_timeout_task);
> }
I guess the thought was that we need to hold because we scheduled a work
item?
Presumably the driver would simply cancel_work_sync() on this timeout
task before it attempts to release its own reference on the netdev, so
this really doesn't protect anything.
Reviewed-by: Jacob Keller <jacob.e.keller@intel.com>
>
On Wed, 16 Apr 2025 13:26:43 -0700 Jacob Keller wrote:
> > @@ -834,7 +833,6 @@ static void octep_vf_tx_timeout(struct net_device *netdev, unsigned int txqueue)
> > {
> > struct octep_vf_device *oct = netdev_priv(netdev);
> >
> > - netdev_hold(netdev, NULL, GFP_ATOMIC);
> > schedule_work(&oct->tx_timeout_task);
> > }
> I guess the thought was that we need to hold because we scheduled a work
> item?
Looks like something I would have asked them to do :)
But it was probably merged before I could review next version ?
I mean, passing NULL for the tracker is... quite something.
> Presumably the driver would simply cancel_work_sync() on this timeout
> task before it attempts to release its own reference on the netdev, so
> this really doesn't protect anything.
It does, but before unregistering :/
Sathesh, schedule_work() returns a value. You should use it.
© 2016 - 2025 Red Hat, Inc.