From nobody Fri Dec 19 12:36:15 2025 Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.14]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 7BC89148856; Tue, 2 Jul 2024 06:39:50 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=198.175.65.14 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1719902392; cv=none; b=YHna2prgVsPElUED8gJBX0UYUKTZLFaQtr3VSdxZ5D8Mlv4Mp9/5SO0HFlGA6KIU0+8ClpP2S+BBzZGlRO8iVAt0z9ULZCYVhIwjEzNm5AI8WbdV32wJqea4GhcOiMP29HeZQJVDXufb3TOv4sOh0JMWsrC2eu+da2tfcwsnbJs= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1719902392; c=relaxed/simple; bh=zarh+EDgol1sdirsIMupecoTuGmDTB5Wrw+1cz3ZnpI=; h=From:To:Cc:Subject:Date:Message-Id:In-Reply-To:References: MIME-Version; b=EUqsTvkOhpSUaGN6dQUyShTWjsM9LPxyn2srDv3Ez+1I7u/QJ6tt6KKTdLZEUgTzQ+yT2yBxOyA4ghOLu5tuVktid5FC/VX3D9Dax9lcsjMDt+n2TulemE938DSdG+XICbHBH9bkpHz1EdF/J2JH7GJYZzShVbXbwuRClY1XJ+o= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.intel.com; spf=none smtp.mailfrom=linux.intel.com; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b=j16DEaY2; arc=none smtp.client-ip=198.175.65.14 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.intel.com Authentication-Results: smtp.subspace.kernel.org; spf=none smtp.mailfrom=linux.intel.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b="j16DEaY2" DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1719902391; x=1751438391; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=zarh+EDgol1sdirsIMupecoTuGmDTB5Wrw+1cz3ZnpI=; b=j16DEaY2XIIwvPCYz0pe+ZbKhipQ1wcK7eUXWqphtq5Ary8l8+Qyz2X1 oDl6Zc3xUGBsEy1KsU9HlYdvCg3HKMeB/J86uxyd5Ez72bp9rbYirkG5C 6tlr07gMZ1TvWB7qph2IKs0B6e8tYA2ZSDRc67ixg4rufMJSl6OtnmA3C Sy+2w8hbZw+QMVQu5Y+qAsxj9uH3Qru6B8FykiI3k/jmG1kNaILtmZ5Ek 1C+MOnEgaQgmJ//bsyeDf18pousJxHMfyiaPI0Cb6lp8JhXzXh+eGcnd/ hiEMVQnD05YDWYzRt+nrDkDf1svtrUtcLZobFYuDOTrEPGvNVJgM16HwQ w==; X-CSE-ConnectionGUID: 8t7hLyghQoumICakdwpAHA== X-CSE-MsgGUID: +XhpdHyWS6K4Kx6X31xAfA== X-IronPort-AV: E=McAfee;i="6700,10204,11120"; a="20868308" X-IronPort-AV: E=Sophos;i="6.09,178,1716274800"; d="scan'208";a="20868308" Received: from orviesa009.jf.intel.com ([10.64.159.149]) by orvoesa106.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 01 Jul 2024 23:39:51 -0700 X-CSE-ConnectionGUID: 3X6BaOToQ6SS94fnsXNGMg== X-CSE-MsgGUID: Su7PP3ceTs+JWRYJaIgFXA== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.09,178,1716274800"; d="scan'208";a="45926368" Received: from unknown (HELO dell-3650.sh.intel.com) ([10.239.159.147]) by orviesa009.jf.intel.com with ESMTP; 01 Jul 2024 23:39:47 -0700 From: Dapeng Mi To: Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , Namhyung Kim , Ian Rogers , Adrian Hunter , Alexander Shishkin , Kan Liang Cc: linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, Yanfei Xu , Dapeng Mi , Dapeng Mi Subject: [PATCH 3/4] perf parse-events: Don't move slots event when no topdwon metrics event Date: Wed, 3 Jul 2024 06:40:36 +0800 Message-Id: <20240702224037.343958-4-dapeng1.mi@linux.intel.com> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20240702224037.343958-1-dapeng1.mi@linux.intel.com> References: <20240702224037.343958-1-dapeng1.mi@linux.intel.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Currently the helper arch_evlist__cmp() would unconditionally move slots event to be the leader event even though there is no topdown metrics event in the group. perf stat -e "{instructions,slots}" -C 0 sleep 1 WARNING: events were regrouped to match PMUs Performance counter stats for 'CPU(s) 0': 27,581,148 slots 8,390,827 instructions 1.045546595 seconds time elapsed This is an overkill. It's not necessary to move slots event as the leader event if there is no topdown metrics event. Thus only regroup events when there are both topdown slots and metrics events in a group. Signed-off-by: Dapeng Mi --- tools/perf/arch/x86/util/evlist.c | 6 ++++-- tools/perf/util/evlist.h | 7 ++++++- tools/perf/util/parse-events.c | 35 ++++++++++++++++++------------- 3 files changed, 31 insertions(+), 17 deletions(-) diff --git a/tools/perf/arch/x86/util/evlist.c b/tools/perf/arch/x86/util/e= vlist.c index 7215c7c7b435..a1e78be6ebd1 100644 --- a/tools/perf/arch/x86/util/evlist.c +++ b/tools/perf/arch/x86/util/evlist.c @@ -73,9 +73,11 @@ int arch_evlist__add_default_attrs(struct evlist *evlist, return ___evlist__add_default_attrs(evlist, attrs, nr_attrs); } =20 -int arch_evlist__cmp(const struct evsel *lhs, const struct evsel *rhs) +int arch_evlist__cmp(const struct evsel *lhs, const struct evsel *rhs, voi= d *priv) { - if (topdown_sys_has_perf_metrics() && + struct sort_priv *_priv =3D priv; + + if (topdown_sys_has_perf_metrics() && _priv->topdown_metrics_in_group && (arch_evsel__must_be_in_group(lhs) || arch_evsel__must_be_in_group(rh= s))) { /* Ensure the topdown slots comes first. */ if (strcasestr(lhs->name, "slots") && !strcasestr(lhs->name, "uops_retir= ed.slots")) diff --git a/tools/perf/util/evlist.h b/tools/perf/util/evlist.h index cb91dc9117a2..14c858dcf5a2 100644 --- a/tools/perf/util/evlist.h +++ b/tools/perf/util/evlist.h @@ -89,6 +89,11 @@ struct evsel_str_handler { void *handler; }; =20 +struct sort_priv { + int force_grouped_idx; + bool topdown_metrics_in_group; +}; + struct evlist *evlist__new(void); struct evlist *evlist__new_default(void); struct evlist *evlist__new_dummy(void); @@ -112,7 +117,7 @@ int arch_evlist__add_default_attrs(struct evlist *evlis= t, #define evlist__add_default_attrs(evlist, array) \ arch_evlist__add_default_attrs(evlist, array, ARRAY_SIZE(array)) =20 -int arch_evlist__cmp(const struct evsel *lhs, const struct evsel *rhs); +int arch_evlist__cmp(const struct evsel *lhs, const struct evsel *rhs, voi= d *priv); =20 int evlist__add_dummy(struct evlist *evlist); struct evsel *evlist__add_aux_dummy(struct evlist *evlist, bool system_wid= e); diff --git a/tools/perf/util/parse-events.c b/tools/perf/util/parse-events.c index 6ed0f9c5581d..a3f7173a7ae2 100644 --- a/tools/perf/util/parse-events.c +++ b/tools/perf/util/parse-events.c @@ -1962,19 +1962,21 @@ static int evsel__compute_group_pmu_name(struct evs= el *evsel, return evsel->group_pmu_name ? 0 : -ENOMEM; } =20 -__weak int arch_evlist__cmp(const struct evsel *lhs, const struct evsel *r= hs) +__weak int arch_evlist__cmp(const struct evsel *lhs, const struct evsel *r= hs, + void *priv __maybe_unused) { /* Order by insertion index. */ return lhs->core.idx - rhs->core.idx; } =20 -static int evlist__cmp(void *_fg_idx, const struct list_head *l, const str= uct list_head *r) +static int evlist__cmp(void *_sort_priv, const struct list_head *l, const = struct list_head *r) { const struct perf_evsel *lhs_core =3D container_of(l, struct perf_evsel, = node); const struct evsel *lhs =3D container_of(lhs_core, struct evsel, core); const struct perf_evsel *rhs_core =3D container_of(r, struct perf_evsel, = node); const struct evsel *rhs =3D container_of(rhs_core, struct evsel, core); - int *force_grouped_idx =3D _fg_idx; + struct sort_priv *sort_priv =3D _sort_priv; + int force_grouped_idx =3D sort_priv->force_grouped_idx; int lhs_sort_idx, rhs_sort_idx, ret; const char *lhs_pmu_name, *rhs_pmu_name; bool lhs_has_group, rhs_has_group; @@ -1992,8 +1994,8 @@ static int evlist__cmp(void *_fg_idx, const struct li= st_head *l, const struct li lhs_sort_idx =3D lhs_core->leader->idx; } else { lhs_has_group =3D false; - lhs_sort_idx =3D *force_grouped_idx !=3D -1 && arch_evsel__must_be_in_gr= oup(lhs) - ? *force_grouped_idx + lhs_sort_idx =3D force_grouped_idx !=3D -1 && arch_evsel__must_be_in_gro= up(lhs) + ? force_grouped_idx : lhs_core->idx; } if (rhs_core->leader !=3D rhs_core || rhs_core->nr_members > 1) { @@ -2001,8 +2003,8 @@ static int evlist__cmp(void *_fg_idx, const struct li= st_head *l, const struct li rhs_sort_idx =3D rhs_core->leader->idx; } else { rhs_has_group =3D false; - rhs_sort_idx =3D *force_grouped_idx !=3D -1 && arch_evsel__must_be_in_gr= oup(rhs) - ? *force_grouped_idx + rhs_sort_idx =3D force_grouped_idx !=3D -1 && arch_evsel__must_be_in_gro= up(rhs) + ? force_grouped_idx : rhs_core->idx; } =20 @@ -2019,16 +2021,17 @@ static int evlist__cmp(void *_fg_idx, const struct = list_head *l, const struct li } =20 /* Architecture specific sorting. */ - return arch_evlist__cmp(lhs, rhs); + return arch_evlist__cmp(lhs, rhs, _sort_priv); } =20 static int parse_events__sort_events_and_fix_groups(struct list_head *list) { - int idx =3D 0, force_grouped_idx =3D -1; struct evsel *pos, *cur_leader =3D NULL; struct perf_evsel *cur_leaders_grp =3D NULL; bool idx_changed =3D false, cur_leader_force_grouped =3D false; int orig_num_leaders =3D 0, num_leaders =3D 0; + struct sort_priv sort_priv =3D {-1, false}; + int idx =3D 0; int ret; =20 /* @@ -2053,13 +2056,17 @@ static int parse_events__sort_events_and_fix_groups= (struct list_head *list) pos->core.idx =3D idx++; =20 /* Remember an index to sort all forced grouped events together to. */ - if (force_grouped_idx =3D=3D -1 && pos =3D=3D pos_leader && pos->core.nr= _members < 2 && - arch_evsel__must_be_in_group(pos)) - force_grouped_idx =3D pos->core.idx; + if (sort_priv.force_grouped_idx =3D=3D -1 && pos =3D=3D pos_leader && + pos->core.nr_members < 2 && arch_evsel__must_be_in_group(pos)) + sort_priv.force_grouped_idx =3D pos->core.idx; + + if (!sort_priv.topdown_metrics_in_group && + strcasestr(pos->name, "topdown")) + sort_priv.topdown_metrics_in_group =3D true; } =20 /* Sort events. */ - list_sort(&force_grouped_idx, list, evlist__cmp); + list_sort(&sort_priv, list, evlist__cmp); =20 /* * Recompute groups, splitting for PMUs and adding groups for events @@ -2070,7 +2077,7 @@ static int parse_events__sort_events_and_fix_groups(s= truct list_head *list) const struct evsel *pos_leader =3D evsel__leader(pos); const char *pos_pmu_name =3D pos->group_pmu_name; const char *cur_leader_pmu_name; - bool pos_force_grouped =3D force_grouped_idx !=3D -1 && + bool pos_force_grouped =3D sort_priv.force_grouped_idx !=3D -1 && arch_evsel__must_be_in_group(pos); =20 /* Reset index and nr_members. */ --=20 2.40.1