[PATCH] module: stats: add lockdep annotation for dup_failed_modules list traversal

Naveen Kumar Chaudhary posted 1 patch 4 days, 15 hours ago
kernel/module/stats.c | 3 ++-
1 file changed, 2 insertions(+), 1 deletion(-)
[PATCH] module: stats: add lockdep annotation for dup_failed_modules list traversal
Posted by Naveen Kumar Chaudhary 4 days, 15 hours ago
read_file_mod_stats() traverses dup_failed_modules with
list_for_each_entry_rcu() while holding module_mutex, but does not pass
the lockdep condition. This triggers a false-positive "RCU-list traversed
in non-reader section" warning with CONFIG_PROVE_RCU_LIST=y.

The warning can be reproduced by:
  1. Racing two loads of the same module to populate dup_failed_modules:
       insmod dummy.ko &
       insmod dummy.ko &
  2. Reading the stats debugfs file:
       cat /sys/kernel/debug/modules/stats

 =============================
 WARNING: suspicious RCU usage
 7.1.0-rc5-gae12a56ba16a #1 Not tainted
 -----------------------------
 kernel/module/stats.c:385 RCU-list traversed in non-reader section!!

 other info that might help us debug this:

 rcu_scheduler_active = 2, debug_locks = 1
 1 lock held by cat/128:
  #0: ffff80008288f7a8 (module_mutex){+.+.}-{4:4}, at: read_file_mod_stats+0x46c/0x5ec

 stack backtrace:
 CPU: 1 UID: 0 PID: 128 Comm: cat Kdump: loaded Not tainted 7.1.0-rc5-gae12a56ba16a #1 PREEMPT
 Hardware name: linux,dummy-virt (DT)
 Call trace:
  show_stack+0x18/0x24 (C)
  __dump_stack+0x28/0x38
  dump_stack_lvl+0x64/0x84
  dump_stack+0x18/0x24
  lockdep_rcu_suspicious+0x134/0x1d4
  read_file_mod_stats+0x554/0x5ec
  full_proxy_read+0xe0/0x1ac
  vfs_read+0xd8/0x2b0
  ksys_read+0x70/0xe4
  __arm64_sys_read+0x1c/0x28
  invoke_syscall+0x48/0xf8
  el0_svc_common+0x8c/0xd8
  do_el0_svc+0x1c/0x28
  el0_svc+0x58/0x1d8
  el0t_64_sync_handler+0x84/0x12c
  el0t_64_sync+0x198/0x19c

The traversal is protected by module_mutex, so pass
lockdep_is_held(&module_mutex) to inform the checker.

Signed-off-by: Naveen Kumar Chaudhary <naveen.osdev@gmail.com>
---
 kernel/module/stats.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/kernel/module/stats.c b/kernel/module/stats.c
index 3a9672f93a8e..a62961acd8e3 100644
--- a/kernel/module/stats.c
+++ b/kernel/module/stats.c
@@ -382,7 +382,8 @@ static ssize_t read_file_mod_stats(struct file *file, char __user *user_buf,
 	mutex_lock(&module_mutex);
 
 
-	list_for_each_entry_rcu(mod_fail, &dup_failed_modules, list) {
+	list_for_each_entry_rcu(mod_fail, &dup_failed_modules, list,
+				lockdep_is_held(&module_mutex)) {
 		if (WARN_ON_ONCE(++count_failed >= MAX_FAILED_MOD_PRINT))
 			goto out_unlock;
 		len += scnprintf(buf + len, size - len, "%25s\t%15lu\t%25s\n", mod_fail->name,
-- 
2.43.0
Re: [PATCH] module: stats: add lockdep annotation for dup_failed_modules list traversal
Posted by Petr Pavlu 3 days, 22 hours ago
On 6/3/26 6:22 PM, Naveen Kumar Chaudhary wrote:
> read_file_mod_stats() traverses dup_failed_modules with
> list_for_each_entry_rcu() while holding module_mutex, but does not pass
> the lockdep condition. This triggers a false-positive "RCU-list traversed
> in non-reader section" warning with CONFIG_PROVE_RCU_LIST=y.
> 
> The warning can be reproduced by:
>   1. Racing two loads of the same module to populate dup_failed_modules:
>        insmod dummy.ko &
>        insmod dummy.ko &
>   2. Reading the stats debugfs file:
>        cat /sys/kernel/debug/modules/stats
> 
>  =============================
>  WARNING: suspicious RCU usage
>  7.1.0-rc5-gae12a56ba16a #1 Not tainted
>  -----------------------------
>  kernel/module/stats.c:385 RCU-list traversed in non-reader section!!
> 
>  other info that might help us debug this:
> 
>  rcu_scheduler_active = 2, debug_locks = 1
>  1 lock held by cat/128:
>   #0: ffff80008288f7a8 (module_mutex){+.+.}-{4:4}, at: read_file_mod_stats+0x46c/0x5ec
> 
>  stack backtrace:
>  CPU: 1 UID: 0 PID: 128 Comm: cat Kdump: loaded Not tainted 7.1.0-rc5-gae12a56ba16a #1 PREEMPT
>  Hardware name: linux,dummy-virt (DT)
>  Call trace:
>   show_stack+0x18/0x24 (C)
>   __dump_stack+0x28/0x38
>   dump_stack_lvl+0x64/0x84
>   dump_stack+0x18/0x24
>   lockdep_rcu_suspicious+0x134/0x1d4
>   read_file_mod_stats+0x554/0x5ec
>   full_proxy_read+0xe0/0x1ac
>   vfs_read+0xd8/0x2b0
>   ksys_read+0x70/0xe4
>   __arm64_sys_read+0x1c/0x28
>   invoke_syscall+0x48/0xf8
>   el0_svc_common+0x8c/0xd8
>   do_el0_svc+0x1c/0x28
>   el0_svc+0x58/0x1d8
>   el0t_64_sync_handler+0x84/0x12c
>   el0t_64_sync+0x198/0x19c
> 
> The traversal is protected by module_mutex, so pass
> lockdep_is_held(&module_mutex) to inform the checker.
> 
> Signed-off-by: Naveen Kumar Chaudhary <naveen.osdev@gmail.com>
> ---
>  kernel/module/stats.c | 3 ++-
>  1 file changed, 2 insertions(+), 1 deletion(-)
> 
> diff --git a/kernel/module/stats.c b/kernel/module/stats.c
> index 3a9672f93a8e..a62961acd8e3 100644
> --- a/kernel/module/stats.c
> +++ b/kernel/module/stats.c
> @@ -382,7 +382,8 @@ static ssize_t read_file_mod_stats(struct file *file, char __user *user_buf,
>  	mutex_lock(&module_mutex);
>  
>  
> -	list_for_each_entry_rcu(mod_fail, &dup_failed_modules, list) {
> +	list_for_each_entry_rcu(mod_fail, &dup_failed_modules, list,
> +				lockdep_is_held(&module_mutex)) {
>  		if (WARN_ON_ONCE(++count_failed >= MAX_FAILED_MOD_PRINT))
>  			goto out_unlock;
>  		len += scnprintf(buf + len, size - len, "%25s\t%15lu\t%25s\n", mod_fail->name,

I mentioned in my reply to your previous patch fixing the use of
synchronize_rcu() in the module dups code [1] that the overall RCU usage
in this code appears to be incorrect. I also noticed that Sashiko
reported the same issue. I think it is not very productive to try to fix
these specific RCU-related problems and instead the code should be
properly reworked. It most likely should not be using RCU at all and
kmod_dup_req should instead be reference-counted.

[1] https://lore.kernel.org/linux-modules/ajydyxgaea27rhcopp5eauji24znotu65d2b4uw344yvmwcc6f@7l5re6f2xcuk/

-- 
Thanks,
Petr
Re: [PATCH] module: stats: add lockdep annotation for dup_failed_modules list traversal
Posted by Naveen Kumar Chaudhary 3 days, 14 hours ago
Thanks Petr. Noted.

On Thu 04 Jun 10:40 AM, Petr Pavlu wrote:
> On 6/3/26 6:22 PM, Naveen Kumar Chaudhary wrote:
> > read_file_mod_stats() traverses dup_failed_modules with
> > list_for_each_entry_rcu() while holding module_mutex, but does not pass
> > the lockdep condition. This triggers a false-positive "RCU-list traversed
> > in non-reader section" warning with CONFIG_PROVE_RCU_LIST=y.
> > 
> > The warning can be reproduced by:
> >   1. Racing two loads of the same module to populate dup_failed_modules:
> >        insmod dummy.ko &
> >        insmod dummy.ko &
> >   2. Reading the stats debugfs file:
> >        cat /sys/kernel/debug/modules/stats
> > 
> >  =============================
> >  WARNING: suspicious RCU usage
> >  7.1.0-rc5-gae12a56ba16a #1 Not tainted
> >  -----------------------------
> >  kernel/module/stats.c:385 RCU-list traversed in non-reader section!!
> > 
> >  other info that might help us debug this:
> > 
> >  rcu_scheduler_active = 2, debug_locks = 1
> >  1 lock held by cat/128:
> >   #0: ffff80008288f7a8 (module_mutex){+.+.}-{4:4}, at: read_file_mod_stats+0x46c/0x5ec
> > 
> >  stack backtrace:
> >  CPU: 1 UID: 0 PID: 128 Comm: cat Kdump: loaded Not tainted 7.1.0-rc5-gae12a56ba16a #1 PREEMPT
> >  Hardware name: linux,dummy-virt (DT)
> >  Call trace:
> >   show_stack+0x18/0x24 (C)
> >   __dump_stack+0x28/0x38
> >   dump_stack_lvl+0x64/0x84
> >   dump_stack+0x18/0x24
> >   lockdep_rcu_suspicious+0x134/0x1d4
> >   read_file_mod_stats+0x554/0x5ec
> >   full_proxy_read+0xe0/0x1ac
> >   vfs_read+0xd8/0x2b0
> >   ksys_read+0x70/0xe4
> >   __arm64_sys_read+0x1c/0x28
> >   invoke_syscall+0x48/0xf8
> >   el0_svc_common+0x8c/0xd8
> >   do_el0_svc+0x1c/0x28
> >   el0_svc+0x58/0x1d8
> >   el0t_64_sync_handler+0x84/0x12c
> >   el0t_64_sync+0x198/0x19c
> > 
> > The traversal is protected by module_mutex, so pass
> > lockdep_is_held(&module_mutex) to inform the checker.
> > 
> > Signed-off-by: Naveen Kumar Chaudhary <naveen.osdev@gmail.com>
> > ---
> >  kernel/module/stats.c | 3 ++-
> >  1 file changed, 2 insertions(+), 1 deletion(-)
> > 
> > diff --git a/kernel/module/stats.c b/kernel/module/stats.c
> > index 3a9672f93a8e..a62961acd8e3 100644
> > --- a/kernel/module/stats.c
> > +++ b/kernel/module/stats.c
> > @@ -382,7 +382,8 @@ static ssize_t read_file_mod_stats(struct file *file, char __user *user_buf,
> >  	mutex_lock(&module_mutex);
> >  
> >  
> > -	list_for_each_entry_rcu(mod_fail, &dup_failed_modules, list) {
> > +	list_for_each_entry_rcu(mod_fail, &dup_failed_modules, list,
> > +				lockdep_is_held(&module_mutex)) {
> >  		if (WARN_ON_ONCE(++count_failed >= MAX_FAILED_MOD_PRINT))
> >  			goto out_unlock;
> >  		len += scnprintf(buf + len, size - len, "%25s\t%15lu\t%25s\n", mod_fail->name,
> 
> I mentioned in my reply to your previous patch fixing the use of
> synchronize_rcu() in the module dups code [1] that the overall RCU usage
> in this code appears to be incorrect. I also noticed that Sashiko
> reported the same issue. I think it is not very productive to try to fix
> these specific RCU-related problems and instead the code should be
> properly reworked. It most likely should not be using RCU at all and
> kmod_dup_req should instead be reference-counted.
> 
> [1] https://lore.kernel.org/linux-modules/ajydyxgaea27rhcopp5eauji24znotu65d2b4uw344yvmwcc6f@7l5re6f2xcuk/
> 
> -- 
> Thanks,
> Petr