Exit when run_perf_stat() returns an error to avoid continuously
repeating the same error message. It's not expected that COUNTER_FATAL
or internal errors are recoverable so there's no point in retrying.
This fixes the following flood of error messages for permission issues,
for example when perf_event_paranoid==3:
perf stat -r 1044 -- false
Error:
Access to performance monitoring and observability operations is limited.
...
Error:
Access to performance monitoring and observability operations is limited.
...
(repeating for 1044 times).
Signed-off-by: Levi Yun <yeoreum.yun@arm.com>
---
Changes in v2:
- Add some comments.
---
tools/perf/builtin-stat.c | 10 +++++++++-
1 file changed, 9 insertions(+), 1 deletion(-)
diff --git a/tools/perf/builtin-stat.c b/tools/perf/builtin-stat.c
index 954eb37ce7b8..0153925f2382 100644
--- a/tools/perf/builtin-stat.c
+++ b/tools/perf/builtin-stat.c
@@ -2875,7 +2875,15 @@ int cmd_stat(int argc, const char **argv)
evlist__reset_prev_raw_counts(evsel_list);
status = run_perf_stat(argc, argv, run_idx);
- if (forever && status != -1 && !interval) {
+ /*
+ * * Meet COUNTER_FATAL situation (i.e) can't open event counter.
+ * * In this case, there is a high chance of failure in the next attempt
+ * * as well with the same reason. so, stop it.
+ * */
+ if (status == -1)
+ break;
+
+ if (forever && !interval) {
print_counters(NULL, argc, argv);
perf_stat__reset_stats();
}
--
LEVI:{C3F47F37-75D8-414A-A8BA-3980EC8A46D7}
On 13/09/2024 03:02, Levi Yun wrote:
> Exit when run_perf_stat() returns an error to avoid continuously
> repeating the same error message. It's not expected that COUNTER_FATAL
> or internal errors are recoverable so there's no point in retrying.
>
> This fixes the following flood of error messages for permission issues,
> for example when perf_event_paranoid==3:
> perf stat -r 1044 -- false
>
> Error:
> Access to performance monitoring and observability operations is limited.
> ...
> Error:
> Access to performance monitoring and observability operations is limited.
> ...
> (repeating for 1044 times).
>
> Signed-off-by: Levi Yun <yeoreum.yun@arm.com>
> ---
> Changes in v2:
> - Add some comments.
> ---
> tools/perf/builtin-stat.c | 10 +++++++++-
> 1 file changed, 9 insertions(+), 1 deletion(-)
>
> diff --git a/tools/perf/builtin-stat.c b/tools/perf/builtin-stat.c
> index 954eb37ce7b8..0153925f2382 100644
> --- a/tools/perf/builtin-stat.c
> +++ b/tools/perf/builtin-stat.c
> @@ -2875,7 +2875,15 @@ int cmd_stat(int argc, const char **argv)
> evlist__reset_prev_raw_counts(evsel_list);
>
> status = run_perf_stat(argc, argv, run_idx);
> - if (forever && status != -1 && !interval) {
> + /*
> + * * Meet COUNTER_FATAL situation (i.e) can't open event counter.
> + * * In this case, there is a high chance of failure in the next attempt
> + * * as well with the same reason. so, stop it.
> + * */
There's something wrong with the formatting here.
But I don't think the comment answers my question about the other return
codes. It just states what the code does.
There are many more return -1's than just for COUNTER_FATAL, so it's not
just that situation anyway. And in addition to that, there is -ENOMEM
and others that aren't -1 which aren't explained that they are
deliberately explicit retry or ignores.
> + if (status == -1)
> + break;
> +
> + if (forever && !interval) {
> print_counters(NULL, argc, argv);
> perf_stat__reset_stats();
> }
> --
> LEVI:{C3F47F37-75D8-414A-A8BA-3980EC8A46D7}
>
On 13/09/2024 09:36, James Clark wrote:
>
>
> On 13/09/2024 03:02, Levi Yun wrote:
>> Exit when run_perf_stat() returns an error to avoid continuously
>> repeating the same error message. It's not expected that COUNTER_FATAL
>> or internal errors are recoverable so there's no point in retrying.
>>
>> This fixes the following flood of error messages for permission issues,
>> for example when perf_event_paranoid==3:
>> perf stat -r 1044 -- false
>>
>> Error:
>> Access to performance monitoring and observability operations is
>> limited.
>> ...
>> Error:
>> Access to performance monitoring and observability operations is
>> limited.
>> ...
>> (repeating for 1044 times).
>>
>> Signed-off-by: Levi Yun <yeoreum.yun@arm.com>
>> ---
>> Changes in v2:
>> - Add some comments.
>> ---
>> tools/perf/builtin-stat.c | 10 +++++++++-
>> 1 file changed, 9 insertions(+), 1 deletion(-)
>>
>> diff --git a/tools/perf/builtin-stat.c b/tools/perf/builtin-stat.c
>> index 954eb37ce7b8..0153925f2382 100644
>> --- a/tools/perf/builtin-stat.c
>> +++ b/tools/perf/builtin-stat.c
>> @@ -2875,7 +2875,15 @@ int cmd_stat(int argc, const char **argv)
>> evlist__reset_prev_raw_counts(evsel_list);
>>
>> status = run_perf_stat(argc, argv, run_idx);
>> - if (forever && status != -1 && !interval) {
>> + /*
>> + * * Meet COUNTER_FATAL situation (i.e) can't open event
>> counter.
>> + * * In this case, there is a high chance of failure in the
>> next attempt
>> + * * as well with the same reason. so, stop it.
>> + * */
>
> There's something wrong with the formatting here.
>
> But I don't think the comment answers my question about the other return
> codes. It just states what the code does.
>
> There are many more return -1's than just for COUNTER_FATAL, so it's not
> just that situation anyway. And in addition to that, there is -ENOMEM
> and others that aren't -1 which aren't explained that they are
> deliberately explicit retry or ignores.
>
If I'm understanding what I think it means, what about something like this:
/*
* Returns -1 for fatal errors which signifies to not continue
* when in repeat mode.
*
* Returns < -1 error codes when stat record is used. These
* result in the stat information being displayed, but writing
* to the file fails and is non fatal.
*/
static int __run_perf_stat(int argc, const char **argv, int run_idx)
{
© 2016 - 2026 Red Hat, Inc.