From nobody Thu Sep 18 05:42:51 2025 Received: from mail-pg1-f201.google.com (mail-pg1-f201.google.com [209.85.215.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 650BA279DD3 for ; Thu, 4 Sep 2025 04:47:36 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.215.201 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1756961257; cv=none; b=XZndfQRJQodebH8Qox8LcJuwsiSrXITZj7wA5YCKleBvBdFe+vnFdib0lPGPrr0DkDK0Z55QCEcExTOp8DjqlTrkD8N1cuZeepiWVnGH2xLFNliTQSVSYe/QCxvhe0/dmCRBYlEX4KUhqYdIW/Mws5F1SlJenYRQa/IVq79dc7g= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1756961257; c=relaxed/simple; bh=PhjYGNLvD4zEEZi7oDaQYcv2vT4Zw20WeOE4Vsw+JSM=; h=Date:In-Reply-To:Mime-Version:References:Message-ID:Subject:From: To:Content-Type; b=fvZZFZnUHW5XooM/r+PkiBigtVFGRXvKBnvb/OvyPJR6Cy+/tF4IGT/8AO6qVD/3sBync875h1EoiBXEztbWDazchW0f4+lKjaui2o3YHehxIyd01tQnyhymXd7jl2du8kPobTbZ9blY9yaJdoUfuhMuiGt9bRedj50j8iqqAsE= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=google.com; spf=pass smtp.mailfrom=flex--irogers.bounces.google.com; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b=nbfyk0J6; arc=none smtp.client-ip=209.85.215.201 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=google.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=flex--irogers.bounces.google.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b="nbfyk0J6" Received: by mail-pg1-f201.google.com with SMTP id 41be03b00d2f7-b4c949fc524so431679a12.2 for ; Wed, 03 Sep 2025 21:47:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20230601; t=1756961256; x=1757566056; darn=vger.kernel.org; h=to:from:subject:message-id:references:mime-version:in-reply-to:date :from:to:cc:subject:date:message-id:reply-to; bh=Joaj1dmh69gNOIsDVm4DeSw8c1Ye9sxRKnnkqlDn7P0=; b=nbfyk0J6nUTGZ0D1WhoLB1gr4bgNhmbzMcscArQwIj70BFpNMxU7LaI7sQ98DE4Clh qXHp7f/R2sNixZmLImjwg+yDomBbBc+jd0iI2Y3BxvX7eWSXGYwXpT7z8+YvQnkEqozH J8yRIu7zE6WHY7GlOxZrxgD00w0RpxDRaPJGeIQNP/A+FfYSYonoSnwFVDKMLaA0PaoB oIus3QW977rvx9of8CKRklF6/Mud1EglAJo0hUPbtgeP4bokoZWnHI0N/TLyCyv7IKth zAcyQssYJ8NeaRhQMC51OA9jAdnFw6R62+hAYNB3VC0TgTDVtiFd2hb3w6kFXEI4CMXk Y0zw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1756961256; x=1757566056; h=to:from:subject:message-id:references:mime-version:in-reply-to:date :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=Joaj1dmh69gNOIsDVm4DeSw8c1Ye9sxRKnnkqlDn7P0=; b=IWw5jMKlMscizorYxXZIhfIomxPkgwJt6Fe0AdROf34sul/hiIPQuDJWRD/VTCXexd pyU76vwtCaij68LefiQgqDFB6EPnuZ3aKFmatzR1WVNpaMXOoE3LFrG57ZCgfI7wUI+q H+hBi4G22wNFYMKM+zLNZOD9e1cBpGP4MvDlCmxmux/Bl0igdhoyJBfbZnYtLaTdhtzZ /mqoAeY+9FQ+5o67/6yNbvQNCC52kKAr7BKhfCz5hp5gLnMCzBqzkqyQKJaSQaVP8lPH iDHDXNM0Rpl6qq2jdGkkLKdLBjhoUGyjx0wL0FFsBZC3POSewnJw+7hgs7+PYZhH6K/g 5ydg== X-Forwarded-Encrypted: i=1; AJvYcCVlq04cqA3+/d9WE5CgSr8Ft9lvQ7f+yfWwB1QKedWv7hR+G5cuc3VeueAwQ0EMJslmcHpQ9ckciEP7idM=@vger.kernel.org X-Gm-Message-State: AOJu0YxFiMT7bAGPT3yRZqiWn65PtQKO/kz+tbK2PORpjxCnrAuWg3zv 1xhDuqfing5k772Kel8vyY4X6AeNtHGC1fFj1WAqJh23BGIbIesDOgjNIzx0yCTzsoJVcvVgmX7 WUd27HEmFXQ== X-Google-Smtp-Source: AGHT+IEmZNfiMVD/3xe+7QRYe6qAfgQJqHO95HuuH8Ysp/rSU+HfIpJ020Jwz/aG5S4Jr1v4s+nCsWJI2Mg/ X-Received: from pjur7.prod.google.com ([2002:a17:90a:d407:b0:31f:b2f:aeed]) (user=irogers job=prod-delivery.src-stubby-dispatcher) by 2002:a17:90a:d603:b0:311:b413:f5e1 with SMTP id 98e67ed59e1d1-328156e4731mr21339560a91.32.1756961255792; Wed, 03 Sep 2025 21:47:35 -0700 (PDT) Date: Wed, 3 Sep 2025 21:46:51 -0700 In-Reply-To: <20250904044653.1002362-1-irogers@google.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: Mime-Version: 1.0 References: <20250904044653.1002362-1-irogers@google.com> X-Mailer: git-send-email 2.51.0.338.gd7d06c2dae-goog Message-ID: <20250904044653.1002362-21-irogers@google.com> Subject: [PATCH v6 20/22] perf jevents: Add local/remote miss latency metrics for Intel From: Ian Rogers To: Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , Namhyung Kim , Mark Rutland , Alexander Shishkin , Jiri Olsa , Ian Rogers , Adrian Hunter , Kan Liang , James Clark , Xu Yang , linux-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org, John Garry , Jing Zhang , Sandipan Das , Benjamin Gray , Perry Taylor , Samantha Alt , Caleb Biggers , Weilin Wang , Edward Baker , Thomas Falcon Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Derive from CBOX/CHA occupancy and inserts the average latency as is provided in Intel's uncore performance monitoring reference. Signed-off-by: Ian Rogers --- tools/perf/pmu-events/intel_metrics.py | 68 ++++++++++++++++++++++++-- 1 file changed, 65 insertions(+), 3 deletions(-) diff --git a/tools/perf/pmu-events/intel_metrics.py b/tools/perf/pmu-events= /intel_metrics.py index befaf0fcd961..47e8f1166870 100755 --- a/tools/perf/pmu-events/intel_metrics.py +++ b/tools/perf/pmu-events/intel_metrics.py @@ -1,8 +1,9 @@ #!/usr/bin/env python3 # SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) -from metric import (d_ratio, has_event, max, CheckPmu, Event, JsonEncodeMe= tric, - JsonEncodeMetricGroupDescriptions, Literal, LoadEvents, - Metric, MetricConstraint, MetricGroup, MetricRef, Sele= ct) +from metric import (d_ratio, has_event, max, source_count, CheckPmu, Event, + JsonEncodeMetric, JsonEncodeMetricGroupDescriptions, + Literal, LoadEvents, Metric, MetricConstraint, MetricG= roup, + MetricRef, Select) import argparse import json import math @@ -612,6 +613,66 @@ def IntelL2() -> Optional[MetricGroup]: ], description =3D "L2 data cache analysis") =20 =20 +def IntelMissLat() -> Optional[MetricGroup]: + try: + ticks =3D Event("UNC_CHA_CLOCKTICKS", "UNC_C_CLOCKTICKS") + data_rd_loc_occ =3D Event("UNC_CHA_TOR_OCCUPANCY.IA_MISS_DRD_LOCAL", + "UNC_CHA_TOR_OCCUPANCY.IA_MISS", + "UNC_C_TOR_OCCUPANCY.MISS_LOCAL_OPCODE", + "UNC_C_TOR_OCCUPANCY.MISS_OPCODE") + data_rd_loc_ins =3D Event("UNC_CHA_TOR_INSERTS.IA_MISS_DRD_LOCAL", + "UNC_CHA_TOR_INSERTS.IA_MISS", + "UNC_C_TOR_INSERTS.MISS_LOCAL_OPCODE", + "UNC_C_TOR_INSERTS.MISS_OPCODE") + data_rd_rem_occ =3D Event("UNC_CHA_TOR_OCCUPANCY.IA_MISS_DRD_REMOTE", + "UNC_CHA_TOR_OCCUPANCY.IA_MISS", + "UNC_C_TOR_OCCUPANCY.MISS_REMOTE_OPCODE", + "UNC_C_TOR_OCCUPANCY.NID_MISS_OPCODE") + data_rd_rem_ins =3D Event("UNC_CHA_TOR_INSERTS.IA_MISS_DRD_REMOTE", + "UNC_CHA_TOR_INSERTS.IA_MISS", + "UNC_C_TOR_INSERTS.MISS_REMOTE_OPCODE", + "UNC_C_TOR_INSERTS.NID_MISS_OPCODE") + except: + return None + + if (data_rd_loc_occ.name =3D=3D "UNC_C_TOR_OCCUPANCY.MISS_LOCAL_OPCODE" = or + data_rd_loc_occ.name =3D=3D "UNC_C_TOR_OCCUPANCY.MISS_OPCODE"): + data_rd =3D 0x182 + for e in [data_rd_loc_occ, data_rd_loc_ins, data_rd_rem_occ, data_rd_r= em_ins]: + e.name +=3D f"/filter_opc=3D{hex(data_rd)}/" + elif data_rd_loc_occ.name =3D=3D "UNC_CHA_TOR_OCCUPANCY.IA_MISS": + # Demand Data Read - Full cache-line read requests from core for + # lines to be cached in S or E, typically for data + demand_data_rd =3D 0x202 + # LLC Prefetch Data - Uncore will first look up the line in the + # LLC; for a cache hit, the LRU will be updated, on a miss, the + # DRd will be initiated + llc_prefetch_data =3D 0x25a + local_filter =3D (f"/filter_opc0=3D{hex(demand_data_rd)}," + f"filter_opc1=3D{hex(llc_prefetch_data)}," + "filter_loc,filter_nm,filter_not_nm/") + remote_filter =3D (f"/filter_opc0=3D{hex(demand_data_rd)}," + f"filter_opc1=3D{hex(llc_prefetch_data)}," + "filter_rem,filter_nm,filter_not_nm/") + for e in [data_rd_loc_occ, data_rd_loc_ins]: + e.name +=3D local_filter + for e in [data_rd_rem_occ, data_rd_rem_ins]: + e.name +=3D remote_filter + else: + assert data_rd_loc_occ.name =3D=3D "UNC_CHA_TOR_OCCUPANCY.IA_MISS_DRD_= LOCAL", data_rd_loc_occ + + ticks_per_cha =3D ticks / source_count(data_rd_loc_ins) + loc_lat =3D interval_sec * 1e9 * data_rd_loc_occ / (ticks_per_cha * data= _rd_loc_ins) + ticks_per_cha =3D ticks / source_count(data_rd_rem_ins) + rem_lat =3D interval_sec * 1e9 * data_rd_rem_occ / (ticks_per_cha * data= _rd_rem_ins) + return MetricGroup("lpm_miss_lat", [ + Metric("lpm_miss_lat_loc", "Local to a socket miss latency in nanose= conds", + loc_lat, "ns"), + Metric("lpm_miss_lat_rem", "Remote to a socket miss latency in nanos= econds", + rem_lat, "ns"), + ]) + + def IntelMlp() -> Optional[Metric]: try: l1d =3D Event("L1D_PEND_MISS.PENDING") @@ -981,6 +1042,7 @@ def main() -> None: IntelIlp(), IntelL2(), IntelLdSt(), + IntelMissLat(), IntelMlp(), IntelPorts(), IntelSwpf(), --=20 2.51.0.338.gd7d06c2dae-goog